Post Job Free
Sign in

Data Engineer Quality

Location:
Jersey City, NJ
Salary:
35
Posted:
October 12, 2025

Contact this candidate

Resume:

Chaitanya Dhulipudi

Data Engineer

+1-201-***-**** ***************.**@*****.***

Professional Summary

Results-oriented Data Engineer with 3+ years of a solid foundation in Python, SQL, and data pipeline development. Skilled in building and optimizing ETL workflows, transforming raw data into clean, usable insights. Experienced with Snowflake, AWS (S3, DynamoDB), and Pandas for scalable data management. Strong analytical mindset with hands-on exposure to large healthcare datasets, focusing on data validation, quality, and automation. Eager to contribute to a high-performing engineering team by delivering efficient, reliable, and secure data solutions in the healthcare domain.

Key Skills

Data Engineering & Databases

ETL pipeline development using Python, SQL, and Snowflake

Data modeling, cleaning, and transformation

Query optimization and database performance tuning

Programming & Tools

Python (NumPy, Pandas, PySpark), Shell scripting

AWS (S3, Lambda, Step Functions, DynamoDB, Batch)

Git, Jupyter, VS Code, ServiceNow

Analytics & Data Quality

Data validation, testing, and debugging

Data documentation and reporting

Strong understanding of healthcare data (claims, clinical, eligibility)

Professional Experience:

Databricks Lead Developer – Mayo clinic Rochester, MN July 2024 – Present

Description:

At Mayo Clinic, contributed to the development of scalable, cloud-based data systems supporting healthcare analytics. Focused on automating ETL processes, improving data quality, and ensuring compliance with internal standards. Gained practical experience working with Snowflake, AWS data services, and healthcare data pipelines, laying a strong foundation for a full-time Data Engineer role.

Responsibilities:

Assisted in designing and maintaining ETL data pipelines using Python and SQL to process and transform healthcare datasets.

Supported data ingestion workflows from multiple sources into Snowflake and validated schema consistency across environments.

Developed data quality checks and basic automation scripts to identify anomalies, missing values, and discrepancies in healthcare claims data.

Collaborated with senior engineers and analysts to optimize performance and scalability of large datasets for analytics and reporting.

Participated in code reviews, unit testing, and documentation to ensure adherence to internal data governance standards.

Monitored AWS S3 and DynamoDB storage performance, ensuring timely data refresh cycles and availability for downstream users.

Data Engineer – Inteq India Mar 2021 – Nov 2022

Description:

Contributed to enterprise data modernization projects by migrating on-premise Teradata/Oracle to Azure Synapse, building ADF/Databricks pipelines, and implementing real-time event streaming (Azure Event Hubs). Delivered cost and billing datasets, designed data lineage frameworks for audit compliance, and supported BI reporting via Power BI dashboards. Automated workflows in Autosys, tuned Spark/SQL jobs, and worked closely with senior engineers to design scalable cloud warehouse solutions. Ensured HIPAA/GDPR compliance across ETL environments

Responsibilities:

Migrated Teradata/Oracle workloads into Azure Synapse and Snowflake, modernizing enterprise data infrastructure with minimal downtime.

Designed and tuned ADF/Databricks pipelines to process large, complex datasets, reducing runtime by 30% and improving reporting efficiency.

Created star schemas, fact/dimension tables, and curated datasets to support BI dashboards and predictive analytics.

Partnered with finance and reporting teams to deliver Power BI dashboards and datasets for cost, billing, and operational insights.

Automated scheduling and monitoring using Airflow, Autosys, and Jenkins, ensuring stable and repeatable pipeline performance.

Participated in data governance initiatives, maintaining compliance with HIPAA/GDPR standards and enhancing audit readiness.

Supported analysts by preparing datasets, validating BI outputs, and troubleshooting SQL/Spark queries for accuracy.

Education

Master’s in Data Science Montclair State University

Relevant Coursework: Cloud Computing, Database Systems, Big Data Analytics, Cybersecurity in Cloud

Bachelor’s in Electrical & Electronics Engineering Vasireddy Venkatadri Institute Of Technology

Certifications

AWS Certified Data Engineer Associate

Microsoft Certified Azure Data Engineer Associate



Contact this candidate