Ruchitha Kurapati
Overland Park, KS ***** ***************@*****.*** +1-913-***-****
Professional Summary
● Data Engineer with 4+ years of hands on experience building scalable data pipelines, cloud data platforms, and ETL frameworks in AWS and Azure environments.
● Proficient in Python, SQL, Azure Synapse, Databricks, AWS Redshift/S3, Kafka, and Apache Spark. Consistently delivered automated, high quality data solutions that improved processing efficiency, data quality, and stakeholder decision making.
● Seeking Data Engineering roles focusing on cloud infrastructure, streaming, and end to end data architecture. Technical Skills
Platforms & Tools: AWS (S3, Redshift, Glue), Azure (Data Factory, Synapse Analytics, Databricks, Stream Analytics)
Programming & ETL: Python, SQL, Apache PySpark, Apache Airflow Big Data: Apache Kafka, Apache Spark, Hadoop ecosystem Data Modeling & Storage: OLAP/OLTP, Star/Snowflake schema, Snowflake, Azure Data Lake, Redshift
DevOps & CI/CD: Azure DevOps, Git, Terraform (optional mention if applicable) Data Quality & Governance: Metadata management, compliance, testing frameworks BI & Visualization: Power BI, Tableau
Work Experience
Data Engineer
Aetna, MO 01/2025 – Present
● Developed and managed automated ETL pipelines using Python, SQL, and Azure Data Factory to ingest and transform data from on premise and SaaS sources, reducing manual hand offs by 70%.
● Built streaming ingestion workflows with Azure Stream Analytics and Apache Kafka to support real time dashboards with < 1 second latency.
● Implemented data governance processes including schema versioning, metadata pipelines, and automated data quality validation, resulting in a 30% reduction in data errors.
● Designed rest APIs and data models in Azure Synapse Analytics for consistent historical and incremental reporting.
● Conducted A/B testing to evaluate the impact of website changes on user engagement, leading to a 15% increase in conversion rates.
● Created interactive dashboards in Tableau and Power BI to monitor KPIs and pipeline performance, accelerating stakeholder decision-making cycles by 25%.
● Collaborated with cross functional teams to architect data warehousing solutions and optimize Spark based transformations, improving query efficiency by 35%. Azure Data Engineer
DXC Technology, India 12/2019 – 07/2023
● Architected and deployed automation via Azure Data Factory pipelines to orchestrate data workflows into Azure Data Lake Storage, Azure Blob Storage, and centralized warehouse.
● Engineered ETL jobs in Databricks (Python, PySpark) to standardize and cleanse large datasets, reducing processing time by 20%.
● Implemented DAX calculations for time series analysis (YoY, MAs) to support business analytics and trend reporting.
● Set up Azure Virtual Networks (VNet) and managed Azure VMs for secure, scalable computing resources.
● Led metadata and data quality initiatives—identifying, tracking, and resolving data defects and mismatches in collaboration with QA and stakeholders.
● Trained junior analysts on ETL best practices and onboarding to Azure tools, improving team productivity and data maturity.
● Developed hands-on experience with Azure SQL Database and Azure Stream Analytics for data processing.
● Performed data mining and statistical analysis to identify trends and patterns in large datasets.
● Cleaned and transformed raw data to ensure accuracy and consistency for reporting purposes.
● Supported the development of data models to facilitate efficient data storage and retrieval. Education
Masters of Science in Computer Science
University of Central Missouri, Warrensburg, MO 08/2023 – 05/2025 Certifications
● Microsoft Certified: Azure Fundamentals (AZ-900)
● Microsoft Certified: Azure Data Fundamentals (DP-900)
● Microsoft Certified: Azure Data Engineer Associate (DP-203)
● Microsoft Certified: Power BI Data Analyst (PL-300)