Post Job Free
Sign in

Data Analytics - ETL - BI - Cloud Platforms Specialist

Location:
Overland Park, KS, 66223
Salary:
75000
Posted:
February 03, 2026

Contact this candidate

Resume:

Rakshitha Garrepelly

KS 816-***-**** *********.*******@*****.*** LinkedIn

Summary

Data Analyst with around 4 years of experience delivering end-to-end data solutions across cloud-native analytics, ETL development, data warehousing, BI reporting, and compliance analytics in industries including BFSI, marketing, and finance. Skilled in SQL, Python (pandas, NumPy, PySpark), Snowflake, Informatica Power Center, dbt, Apache Airflow, and AWS (Glue, S3, Lambda, CloudWatch), with expertise in data modeling, governance, quality frameworks, and performance tuning. Proven ability to collaborate with cross-functional teams in agile environments to design scalable data pipelines, automate workflows, and develop actionable dashboards, driving measurable business impact and regulatory compliance.

Technical Skills

Programming Languages: Python (pandas, NumPy, PySpark, openpyxl), SQL (T-SQL, PL/SQL), PowerShell, VBA Macros ETL & Data Integration: dbt, AWS Glue, Apache Airflow, Informatica Power Center, Custom Python ETL, MSSQL ETL, Structured & Semi-Structured Data Handling (CSV, JSON, Parquet, ORC) Data Warehousing: Snowflake, Oracle, MS SQL Server, Data Modeling (Star/Snowflake Schema), Data Normalization/Denormalization, Partitioning, Indexing Strategies Data Visualization & BI: Power BI (DAX, Power Query), Tableau, Excel (PivotTables, Power Query, Advanced Formulas) Cloud Platforms: AWS (EC2, S3, Glue, Lambda, IAM, CloudWatch), Cloud-Native Data Platforms Data Governance & Quality: Great Expectations, Collibra, Informatica Data Quality (IDQ), Access Control Policies, Data Profiling, Metadata Management, Data Lineage Documentation

Data Processing Frameworks: PySpark, Pandas, NumPy, SQL Optimization Techniques, Stored Procedures Statistical Analysis: KPI Evaluation, A/B Testing, Hypothesis Testing, SciPy, scikit-learn, XGBoost, Regression Models, Time-Series Analysis

Collaboration & Agile: Agile/Scrum Methodologies, Jira, Confluence, SharePoint, Cross-Functional Team Collaboration Experience

Elevance Health, Data Analyst, TX Aug 2024 – Present

Built production ETL pipelines using Python (pandas, NumPy) and dbt in Snowflake to process 2M+ member behavioral data points for Sydney Health app, increasing member engagement by 25%.

Constructed advanced SQL queries (T-SQL, PL/SQL) with star schema data modeling across 500K+ patient records using Whole Health Index framework, enabling 45K+ preventive interventions and 12% HEDIS improvement.

Discovered denial prediction patterns in 3M+ claims leveraging scikit-learn and XGBoost regression models, reducing denial rates by 68% through automated prior authorization AI across business units.

Aggregated longitudinal member data using PySpark and multi-dimensional data profiling across clinical, behavioral, and social determinants, segmenting 120K high-risk patients and reducing readmissions by 15%.

Implemented data quality frameworks using Great Expectations and Collibra metadata management achieving 99% accuracy in member matching while ensuring GDPR/HIPAA compliance in cloud environments.

Benchmarked bias and accuracy metrics using scikit-learn across demographic cohorts for member risk models, validating responsible AI principles and supporting enterprise OpenAI chatbot deployment.

Composed unified member data architecture by integrating claims, clinical, and genomic data using dbt and snowflake schema modeling in HealthOS, reducing prior authorization time from 48 to 2 hours.

Orchestrated real-time data pipelines using Apache Airflow and AWS Glue streaming member health alerts to Epic EHR, generating 150+ daily care gap notifications across 92% of Medicare Advantage membership. Hexaware Technologies, Data Analyst, India Jun 2020 – Dec 2022

Analyzed multi-source banking, insurance, and compliance datasets from Oracle, MS SQL Server, and flat file feeds, identifying transactional anomalies and reconciling discrepancies worth $2M+ in high-risk accounts.

Developed and optimized ETL workflows using Informatica Power Center and SQL to integrate core banking, claims, and policy data into a centralized warehouse, ensuring daily SLA compliance for regulatory reporting.

Built operational and compliance dashboards in Power BI and Excel (PivotTables, Power Query) to track KYC/AML metrics, policy lapse rates, and claims processing times, enabling executives to meet Basel III & AML directives.

Automated monthly and quarterly risk reporting with Python (pandas, openpyxl) and VBA macros, reducing report preparation time from 3 days to 6 hours and eliminating manual reconciliation errors.

Implemented business rules and data quality checks using SQL constraints, data profiling scripts, and Informatica Data Quality (IDQ) to maintain 99%+ accuracy in regulatory submissions.

Collaborated with compliance officers, underwriters, and IT teams across Agile sprints to enhance fraud detection models, integrating third-party fraud watchlist APIs for real-time verification.

Conducted root cause analysis for SLA breaches using Jira Service Desk and log analysis tools, leading to 20% faster resolution times for critical compliance incidents.

Prepared documentation for ETL mappings, data lineage, and governance workflows in Confluence and SharePoint, ensuring audit readiness for internal and external inspections.

Projects

Music Store Business Analysis — SQL (2025)

Developed a series of SQL queries to analyze sales performance, customer segments, and product revenue trends. Identified high-value customers and top-selling items, highlighting growth opportunities. Power BI Survey Breakdown — Data Analytics (2025)

Built an interactive Power BI dashboard analyzing survey responses from data professionals. Cleaned and modeled data using Power Query and DAX to uncover trends in salaries, job roles, and tool usage across demographics. Crypto API ETL Automation — Python (2025)

Developed a Python-based ETL script that automatically pulls real-time cryptocurrency data from public APIs. Parsed, cleaned, and stored structured outputs for analysis, demonstrating practical experience in API ingestion and automation. Crowd Density Monitoring — Machine Learning & Computer Vision (2021) Built a YOLO + OpenCV model to detect crowd density in real time and trigger alerts when thresholds were exceeded. Certificates

Microsoft Azure Fundamentals Microsoft

Machine Intelligence and Brain Research - IIT Madras

Data Analytics Essentials – CISCO

Google Analytics Certification Google

SQL (Basic, Intermediate & Advanced) Certification HackerRank

Cybersecurity Job Simulation Mastercard

Data Visualization Empowering Business with Effective Insights Job Simulation – TATA Group Education

Master’s in computer science University of Central Missouri, MO Jan 2023 - May 2024 Bachelors in Electronics and Communication Engineering JNTUH, India Aug 2017 - Jul 2021



Contact this candidate