Chaitanya Dhulipudi
Data Engineer
+1-201-***-**** ***************.**@*****.***
Professional Summary
Results-oriented Data Engineer with 3+ years of a solid foundation in Python, SQL, and data pipeline development. Skilled in building and optimizing ETL workflows, transforming raw data into clean, usable insights. Experienced with Snowflake, AWS (S3, DynamoDB), and Pandas for scalable data management. Strong analytical mindset with hands-on exposure to large healthcare datasets, focusing on data validation, quality, and automation. Eager to contribute to a high-performing engineering team by delivering efficient, reliable, and secure data solutions in the healthcare domain.
Key Skills
Data Engineering & Databases
ETL pipeline development using Python, SQL, and Snowflake
Data modeling, cleaning, and transformation
Query optimization and database performance tuning
Programming & Tools
Python (NumPy, Pandas, PySpark), Shell scripting
AWS (S3, Lambda, Step Functions, DynamoDB, Batch)
Git, Jupyter, VS Code, ServiceNow
Analytics & Data Quality
Data validation, testing, and debugging
Data documentation and reporting
Strong understanding of healthcare data (claims, clinical, eligibility)
Professional Experience:
Databricks Lead Developer – Mayo clinic Rochester, MN July 2024 – Present
Description:
At Mayo Clinic, contributed to the development of scalable, cloud-based data systems supporting healthcare analytics. Focused on automating ETL processes, improving data quality, and ensuring compliance with internal standards. Gained practical experience working with Snowflake, AWS data services, and healthcare data pipelines, laying a strong foundation for a full-time Data Engineer role.
Responsibilities:
Assisted in designing and maintaining ETL data pipelines using Python and SQL to process and transform healthcare datasets.
Supported data ingestion workflows from multiple sources into Snowflake and validated schema consistency across environments.
Developed data quality checks and basic automation scripts to identify anomalies, missing values, and discrepancies in healthcare claims data.
Collaborated with senior engineers and analysts to optimize performance and scalability of large datasets for analytics and reporting.
Participated in code reviews, unit testing, and documentation to ensure adherence to internal data governance standards.
Monitored AWS S3 and DynamoDB storage performance, ensuring timely data refresh cycles and availability for downstream users.
Data Engineer – Inteq India Mar 2021 – Nov 2022
Description:
Contributed to enterprise data modernization projects by migrating on-premise Teradata/Oracle to Azure Synapse, building ADF/Databricks pipelines, and implementing real-time event streaming (Azure Event Hubs). Delivered cost and billing datasets, designed data lineage frameworks for audit compliance, and supported BI reporting via Power BI dashboards. Automated workflows in Autosys, tuned Spark/SQL jobs, and worked closely with senior engineers to design scalable cloud warehouse solutions. Ensured HIPAA/GDPR compliance across ETL environments
Responsibilities:
Migrated Teradata/Oracle workloads into Azure Synapse and Snowflake, modernizing enterprise data infrastructure with minimal downtime.
Designed and tuned ADF/Databricks pipelines to process large, complex datasets, reducing runtime by 30% and improving reporting efficiency.
Created star schemas, fact/dimension tables, and curated datasets to support BI dashboards and predictive analytics.
Partnered with finance and reporting teams to deliver Power BI dashboards and datasets for cost, billing, and operational insights.
Automated scheduling and monitoring using Airflow, Autosys, and Jenkins, ensuring stable and repeatable pipeline performance.
Participated in data governance initiatives, maintaining compliance with HIPAA/GDPR standards and enhancing audit readiness.
Supported analysts by preparing datasets, validating BI outputs, and troubleshooting SQL/Spark queries for accuracy.
Education
Master’s in Data Science Montclair State University
Relevant Coursework: Cloud Computing, Database Systems, Big Data Analytics, Cybersecurity in Cloud
Bachelor’s in Electrical & Electronics Engineering Vasireddy Venkatadri Institute Of Technology
Certifications
AWS Certified Data Engineer Associate
Microsoft Certified Azure Data Engineer Associate