Data Engineer - ETL/ELT & Data Pipeline Expert

Location:

United States

Salary:

Posted:

May 01, 2026

Contact this candidate

Resume:

Vikas Reddy Amaravathi

Phone: +1-210-***-**** Email: **********@*****.*** LinkedIn: LinkedIn

PROFESSIONAL SUMMARY

Data Engineer with 4+ years of experience in building scalable ETL/ELT and real-time data pipelines across healthcare, financial services, and payments domains. Proficient in Python, Shell Scripting, and Oracle development, with a strong backend focus on processes and architecture improvements. Expertise in Linux environment setup, Unix file systems, and data warehousing using Snowflake and AWS. Skilled in automation, process improvement, and data governance, with practical experience in Airflow and Informatica for orchestration and ETL processes. Proven ability to enhance Linux-based toolsets and scripts, manage data flows, and support AI/ML pipelines. Adept at collaborating with cross-functional teams to deliver high-quality data engineering solutions, utilizing Agile methodology and infrastructure as code practices. TECHNICAL SKILLS

Programming Languages: Python, SQL, Scala, Shell Scripting, Perl Frameworks & Libraries: Apache Spark, PySpark, dbt Cloud & DevOps: AWS (S3, EC2, IAM, Glue), Azure (Data Factory, ADLS), Docker, Kubernetes, Terraform Databases: Oracle, MySQL, PostgreSQL, Snowflake, Amazon Redshift, Oracle Exadata Data Warehousing: Snowflake, Amazon Redshift, Data Lake, Lakehouse Architecture Tools & Technologies: Apache Kafka, Airflow, Informatica, Git, Jenkins, GitHub Actions, orchestration tools AI & Generative AI: Feature Engineering, Predictive Analytics, Machine Learning Pipeline Support Operating Systems: Linux, Unix

Additional Skills: mount types, permissions, standard tools, pipes, infrastructure management, toolsets enhancement, scripts enhancement, database load / extract processes, data lineage, data cataloging PROFESSIONAL EXPERIENCE

Cigna Sep 2025 - Present

Data Engineer Texas, USA

• Designed and maintained scalable ETL / ELT data pipelines for processing large-scale insurance, financial, and customer datasets, focusing on backend processes.

• Built and optimized batch and real-time data ingestion frameworks using Apache Airflow, Python, and Spark, enhancing database load / extract processes.

• Developed data pipelines for ingesting structured and semi-structured data into Snowflake data warehouse, ensuring robust data lineage.

• Implemented data quality frameworks and monitoring mechanisms to ensure data integrity and reliability, utilizing standard tools and permissions.

• Utilized AWS Glue Data Catalog for metadata management and data discovery, supporting data cataloging initiatives.

• Deployed containerized workloads using Docker and Kubernetes to improve scalability and deployment consistency, leveraging infrastructure as code practices.

Charles Schwab Jul 2024 - May 2025

Data Engineer Texas, USA

• Developed and maintained ETL / ELT data pipelines for consumer lending and financial transaction datasets, with a focus on backend infrastructure management.

• Built batch data ingestion frameworks using Python and SQL for centralized data warehouse integration, enhancing scripts and toolsets.

• Managed data pipelines for ingesting data from relational databases into MySQL and PostgreSQL, optimizing database load / extract processes.

• Enhanced ETL workflows using modular SQL transformations and dbt for improved maintainability, focusing on toolsets enhancement.

• Improved query performance through indexing, query tuning, and schema optimization techniques, ensuring efficient use of mount types and pipes.

• Collaborated with cross-functional teams to translate business requirements into scalable data solutions, adhering to Agile methodology.

American Express May 2021 - Jul 2023

Data Engineer Hyderabad, India

• Assisted in developing ETL data pipelines for financial transaction and customer datasets, focusing on backend data warehouses.

• Supported batch data ingestion and transformation processes using Python, SQL, and basic Spark, enhancing scripts and processes.

• Monitored ETL workflows and supported Apache Airflow scheduling for reliable data pipeline execution, utilizing orchestration tools.

• Contributed to building curated datasets for business reporting and analytics, ensuring data lineage and cataloging.

• Supported AWS-based data storage and processing using S3 and EC2, focusing on Linux-based processes.

• Collaborated with senior engineers to understand requirements and support data engineering solutions, improving infrastructure management.

EDUCATION

Master of Science in Computer Science University of North Texas Jan 2023 - July 2025 Denton, TX, USA

Contact this candidate