Post Job Free
Sign in

Data Engineer Engineering

Location:
Texas
Posted:
October 30, 2025

Contact this candidate

Resume:

HEMENDRA GUMMADI

Dallas, TX 503-***-**** ***************@*****.*** hemendra-gummadi

SKILLS

Programming & Scripting: SQL, Python (Pandas, Boto3), PySpark, Shell Scripting, JavaScript, Scala, HTML/CSS Data Engineering & Cloud Technologies: AWS (Glue, Lambda, S3, EC2, Redshift, Kinesis, DynamoDB, API Gateway, IAM, RDS), Snowflake, Databricks, Apache Spark, Apache Airflow, DBT, SQL Server, PostgreSQL, MongoDB, Data Modeling (Star Schema), ETL/ELT Development, Data Warehousing, Data Lakes, API Integration, Tableau Other Skills: Excel, CI/CD for Data Pipelines, Version Control (Git, GitHub), Docker, Monitoring & Alerting (CloudWatch, Airflow UI), Workflow Scheduling, Agile Development Practices, Jira, Stakeholder Collaboration & Documentation EDUCATION

The University of Texas at Dallas, M.S. Business Analytics – Data Science (Concentration)2023 EXPERIENCE

Match4Action-CrowdDoing Dallas, TX

Data Engineer Mar 2024 – Mar 2025

● Built scalable data pipelines using AWS Glue and Python to extract data from RDS and ingest it into S3, handling 10M+ records weekly and reducing data delivery time to downstream systems by 45%.

● Transformed raw data using dbt and SQL within Snowflake, creating clean and reusable data models that improved reporting accuracy and reduced analyst turnaround time by 30%.

● Developed and maintained high-volume data transformation pipelines using PySpark on Databricks, processing over 500M records monthly and enabling scalable analytics for product, finance, and operations teams, cutting ad-hoc reporting requests by 35%.

● Orchestrated ETL workflows using Airflow and Lambda, enabling automated scheduling, failure alerts, and retry logic, resulting in 99% on-time data pipeline execution across business-critical workflows.

● Integrated Tableau dashboards with Snowflake, giving stakeholders near real-time access to metrics and trends, which improved business decision-making speed and sales forecasting accuracy by 20%.

● Optimized SQL queries and Snowflake warehouse performance using clustering and partitioning strategies, reducing dashboard load times by 60% and saving 25% in compute costs.

● Implemented data governance controls using Glue Data Catalog and dbt tests, ensuring schema consistency, data lineage visibility, and audit compliance, helping 5+ teams rely more confidently on their data Archi’s Academy Dallas, TX

Data Analytics Engineer July 2023 – Jan 2024

● Engineered scalable ELT pipelines using AWS Glue, S3, and Snowflake to process 1M+ records from Crunchbase, reducing processing time by 30% and orchestrating workflows using Apache Airflow with optimized Glue job configurations.

● Built and maintained 10+ dbt models, ensuring 99.6% data consistency and enabling data-driven decisions that supported $400K in new funding initiatives.

● Designed serverless pipelines with AWS Lambda and Fargate for real-time and batch workloads, integrating Glue Data Catalog for partitioning and seamless querying through Athena, Redshift Spectrum, and EMR.

● Containerized ETL jobs with Docker and managed CI/CD pipelines through Git and Airflow, while collaborating in Agile sprints using Jira, accelerating deployments and improving testing efficiency by 30%.

● Developed Tableau dashboards that increased stakeholder engagement by 25%, uncovering insights that influenced strategic focus on academic grants.

● Transformed unstructured datasets into actionable intelligence, improving investment prediction accuracy by 40% and directly supporting strategic funding decisions.

HCL Technologies (Google)Hyderabad, India

Data Engineer Mar 2020 – July 2021

● Optimized ETL pipelines using GCP tools (BigQuery, Dataflow, Cloud Pub/Sub), reducing data latency by 35% for real-time advertising performance analytics.

● Developed and deployed TensorFlow-based machine learning models for ad revenue forecasting, improving forecast accuracy by 15%.

● Enhanced data warehouse efficiency by implementing partitioning and clustering in BigQuery, boosting query performance by 40%.

● Collaborated with cross-functional teams to create real-time dashboards in Google Data Studio and Looker, enabling data-driven decisions for major clients including Walt Disney and The New York Times. Certifications & Achievements:

● Graduate certificate in Applied Machine Learning (University of Texas at Dallas).



Contact this candidate