Post Job Free
Sign in

Senior Data Engineer GCP & AWS ETL Expert

Location:
Danbury, CT
Posted:
May 01, 2026

Contact this candidate

Resume:

SRUTHI LENKALA

New Haven, CT 314-***-**** ******************@*****.*** LinkedIn

PROFESSIONAL PROFILE

Senior Data Engineer with 6+ years of hands-on Python experience building and migrating enterprise data pipelines in AWS and GCP. Skilled in AWS Glue, Spark, and Airflow to design robust ETL workflows, having migrated multiple major workloads and reduced pipeline runtime by 25%. Proven ability to deliver scalable, production-ready solutions that drive analytics and business insights. PROFESSIONAL EXPERIENCE

The Home Depot Feb 2024 - Present

Senior Data Engineer (GCP Platform) McLean, VA

• Built and maintained 50+ python-driven production data pipelines on Google Cloud Platform using Cloud Composer, BigQuery, and Dataproc, supporting analytics for 2,000+ stores nationwide

• Reduced average pipeline runtime by 25% through query optimization and proper partitioning strategies in BigQuery

• Designed a configuration-driven DAG factory framework in Airflow that cut onboarding time for new data feeds from 3 days to under 4 hours

• Implemented Change Data Capture (CDC) and incremental load patterns using BigQuery stored procedures, improving financial reporting data freshness from daily to near-real-time

• Created partitioned and clustered BigQuery tables based on actual query patterns, lowering monthly query costs by approximately 15%

• Led the data validation effort for a major content migration project, reconciling over 10TB of inventory and sales data between legacy systems and BigQuery with 99.9% accuracy

• Established row-level security and authorized views in BigQuery, enabling secure data sharing between analytics, finance, and supply chain teams

• Served in weekly on-call rotations, troubleshooting pipeline failures across GCP and AWS environments and performing data backfills with minimal business disruption

• Collaborated directly with business analysts to design datasets that now power executive dashboards tracking $150B+ in annual revenue

WalkingTree Technologies Jun 2020 - Aug 2022

Data Analytics Engineer Hyderabad, India

• Supported enterprise analytics pipelines for financial risk modeling, loan performance tracking, and regulatory reporting for a BFSI client.

• Migrated 3 major data workloads from Redshift to Snowflake using Glue, improving query performance by 30% while reducing storage costs

• Developed spark-based data transformation jobs using PySpark, processing 18+ records monthly for data cleansing and business rule application

• Designed and implemented SCD Type-2 dimension tables with historical tracking for critical financial hierarchies

• Built 15+ Airflow DAGs to orchestrate batch processing across multiple business units with complex dependencies

• Automated manual file ingestion processes using Python and Cloud Functions, saving approximately 20 hours of manual effort monthly

• Provided weekly production support in a regulated financial environment, resolving data discrepancies and pipeline issues

• Collaborated with upstream source system teams and downstream analysts to ensure data accuracy and timeliness Aurora e-Labs Pvt. Ltd Apr 2018 - May 2020

Big Data Engineer Hyderabad, India

• Built Hadoop ingestion pipelines using Sqoop to extract data from Oracle/MySQL into HDFS for downstream analytics.

• Developed Spark/PySpark transformation jobs applying business rules for customer analytics use cases.

• Designed Hive data models leveraging partitioning and bucketing to optimize large-scale reporting queries.

• Automated batch ETL workflows using Oozie to improve job orchestration reliability.

• Loaded curated datasets into HBase to enable low-latency access for customer-facing applications.

• Provided production support for distributed Hadoop jobs, implementing monitoring and resolving pipeline failures. EDUCATION

University of New Haven 2022 - 2023

Master of Science, Business Analytics West Haven, CT Jawaharlal Nehru Technological University 2012 - 2016 Bachelor of Technology, Electrical & Electronics Engineering Hyderabad, India CERTIFICATIONS

• Google Cloud Certified - Professional Data Engineer (2022)

• AWS Certified Data Analytics - Specialty (2021)

TECHNICAL EXPERTISE

• CLOUD PLATFORMS: BigQuery, Cloud Composer (Airflow), Dataproc, Cloud Functions, Cloud Run, GCS, IAM, VPC, S3, EMR, Glue, Lambda, Redshift, Kinesis, SQS, CloudFormation, AWS

• BIG DATA PROCESSING: Spark (Core, SQL, PySpark), Hadoop (CDH), Hive, Sqoop, Oozie, Data Pipeline Orchestration, ETL/ELT Design, Data Modeling, Content Migration

• DATA WAREHOUSING: BigQuery, Snowflake, Redshift, Oracle, MySQL, PostgreSQL, HBase

• PROGRAMMING & DEVOPS: Python, SQL, PySpark, Bash/Shell Scripting, Core Java, Git, Jenkins, Terraform, Docker, CI/CD Pipelines

• VISUALIZATION & COLLABORATION: Tableau, Looker Studio, Power BI, Jira, Confluence, Agile/Scrum KEY ACHIEVEMENTS

• Built 50+ production data pipelines supporting analytics for 2,000+ stores

• Reduced pipeline runtime by 25% through optimization strategies

• Cut data feed onboarding time from 3 days to under 4 hours

• Lowered BigQuery monthly costs by approximately 15%

• Validated migration of 10TB+ data with 99.9% accuracy



Contact this candidate