CHINNABABULREDDY VARRA
DATA ENGINEER
USA +1-763-***-**** *********************@*****.***
Linkedin: https://www.linkedin.com/in/chinnababul-varra-988272357/ SUMMARY
Data Engineer with 4+ years of experience building scalable ETL/ELT pipelines using Azure (ADF, Databricks, Delta Lake), AWS (Glue, EMR, S3), and Snowflake. Strong expertise in PySpark, SQL, Kafka, Airflow, and Lakehouse architectures. Proven success improving pipeline performance by 30–50%, optimizing cloud costs, and delivering real-time analytics for healthcare and financial domains. Experienced in CI/CD (GitLab, Jenkins, Terraform) and modern data engineering best practices.
TECHNICAL SKILLS
Methodologies: SDLC, Agile, Waterfall
Programming Language: Python, R, SQL, Scala
IDE’s: PyCharm, Jupyter Notebook
Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn, TensorFlow
Databases: MySQL, SQL Server, PostgreSQL, Oracle
Big Data Ecosystem: Hadoop, MapReduce, Apache Spark, Sqoop, Pyspark
ETL & Orchestration: Apache Airflow, Apache NiFi, SSIS, Talend, Informatica
Reporting Tools: Power BI, Tableau, SSRS, QuickSight
Version Control Tools: Jenkins, GitLab, Terraform, Docker, Git, GitHub
Other Skills: Data Modeling (Star, Snowflake), Data Lakehouse Architecture, Delta Live Tables, Microsoft Fabric, dbt, WORK EXPERIENCE
BANK OF AMERICA, USA Data Engineer Feb 2025 – PRESENT
● Developed and optimized automated ETL pipelines using Talend, SSIS, and Hadoop, improving data flow efficiency by 30% while maintaining 99% data accuracy.
● Designed and deployed scalable data pipelines on AWS (S3, EC2, Lambda, Glue), reducing processing time by 40% and improving pipeline reliability.
● Built 50+ data visualizations using Matplotlib and Seaborn to simplify technical insights for business teams.
● Created and delivered 30+BI reports using Tableau and SSRS, increasing data-driven decision-making efficiency by 25%.
● Managed multi-cloud architectures across AWS S3, Azure Data Lake, and GCP BigQuery, enabling secure and cost- effective analytics solutions.
● Automated ingestion and transformation workflows using Apache Airflow, reducing manual efforts by 50%.
● Implemented Great Expectations for automated data validation, reducing data quality issues by 40%.
● Designed cost-optimized Snowflake architecture reducing compute and storage costs by 25%. CIGNA HEALTHCARE, USA Data Engineer Aug 2024 – Jan 2025
● Implemented Delta Lake on Azure Data Lake and AWS S3, reducing storage redundancy by 30% through version- controlled storage.
● Improved performance of petabyte-scale clinical datasets, enabling faster decision-making for healthcare providers.
● Ensured full compliance with HIPAA and data governance policies throughout data engineering processes.
● Automated CI/CD pipelines using GitLab, Jenkins & Terraform, reducing deployment time by 50%.
● Enhanced clinical reporting efficiency by reducing Snowflake and BigQuery query times by 35%.
● Built scalable ETL pipelines using ADF, Databricks, PySpark, processing 5+ TB of daily healthcare data. Nefroverse Technologies, India Jr. Data Engineer Jun 2021 – Aug 2023
● Built and optimized ETL pipelines using Python, SQL, and Hadoop, increasing processing efficiency by 40%.
● Designed real-time streaming architecture using Apache Kafka, improving pharmacy data quality by 30%.
● Migrated on-prem SSIS workflows to AWS Glue & S3, significantly reducing infrastructure cost.
● Improved batch processing on AWS EMR (Spark), cutting pipeline execution time by 30%.
● Developed CI/CD workflows using Jenkins and Docker, reducing deployment errors by 25%.
● Built analytics dashboards in QuickSight, enabling faster healthcare KPI insights. EDUCATION
Master of Science - Information Technology and Management Concordia University, Saint Paul, USA
Bachelor of Technology - Computer Science & Engineering Kalasalingam University of Research and Education
DATA SCIENCE 4 years course by IBM
CERTIFICATIONS
Azure Data Engineer Associate
AWS Certified Data Engineer
DBT Data Modeling
Databricks Lakehouse Fundamentals