HEMENDRA GUMMADI
Dallas, TX 503-***-**** ***************@*****.*** hemendra-gummadi
SKILLS
Programming & Scripting: SQL, Python (Pandas, Boto3), PySpark, Shell Scripting, JavaScript, Scala, HTML/CSS Data Engineering & Cloud Technologies: AWS (Glue, Lambda, S3, EC2, Redshift, Kinesis, DynamoDB, API Gateway, IAM, RDS), Snowflake, Databricks, Apache Spark, Apache Airflow, DBT, SQL Server, PostgreSQL, MongoDB, Data Modeling (Star Schema), ETL/ELT Development, Data Warehousing, Data Lakes, API Integration, Tableau Other Skills: Excel, CI/CD for Data Pipelines, Version Control (Git, GitHub), Docker, Monitoring & Alerting (CloudWatch, Airflow UI), Workflow Scheduling, Agile Development Practices, Jira, Stakeholder Collaboration & Documentation EDUCATION
The University of Texas at Dallas, M.S. Business Analytics – Data Science (Concentration)2023 EXPERIENCE
Match4Action-CrowdDoing Dallas, TX
Data Engineer Mar 2024 – Mar 2025
● Built scalable data pipelines using AWS Glue and Python to extract data from RDS and ingest it into S3, handling 10M+ records weekly and reducing data delivery time to downstream systems by 45%.
● Transformed raw data using dbt and SQL within Snowflake, creating clean and reusable data models that improved reporting accuracy and reduced analyst turnaround time by 30%.
● Developed and maintained high-volume data transformation pipelines using PySpark on Databricks, processing over 500M records monthly and enabling scalable analytics for product, finance, and operations teams, cutting ad-hoc reporting requests by 35%.
● Orchestrated ETL workflows using Airflow and Lambda, enabling automated scheduling, failure alerts, and retry logic, resulting in 99% on-time data pipeline execution across business-critical workflows.
● Integrated Tableau dashboards with Snowflake, giving stakeholders near real-time access to metrics and trends, which improved business decision-making speed and sales forecasting accuracy by 20%.
● Optimized SQL queries and Snowflake warehouse performance using clustering and partitioning strategies, reducing dashboard load times by 60% and saving 25% in compute costs.
● Implemented data governance controls using Glue Data Catalog and dbt tests, ensuring schema consistency, data lineage visibility, and audit compliance, helping 5+ teams rely more confidently on their data Archi’s Academy Dallas, TX
Data Analytics Engineer July 2023 – Jan 2024
● Engineered scalable ELT pipelines using AWS Glue, S3, and Snowflake to process 1M+ records from Crunchbase, reducing processing time by 30% and orchestrating workflows using Apache Airflow with optimized Glue job configurations.
● Built and maintained 10+ dbt models, ensuring 99.6% data consistency and enabling data-driven decisions that supported $400K in new funding initiatives.
● Designed serverless pipelines with AWS Lambda and Fargate for real-time and batch workloads, integrating Glue Data Catalog for partitioning and seamless querying through Athena, Redshift Spectrum, and EMR.
● Containerized ETL jobs with Docker and managed CI/CD pipelines through Git and Airflow, while collaborating in Agile sprints using Jira, accelerating deployments and improving testing efficiency by 30%.
● Developed Tableau dashboards that increased stakeholder engagement by 25%, uncovering insights that influenced strategic focus on academic grants.
● Transformed unstructured datasets into actionable intelligence, improving investment prediction accuracy by 40% and directly supporting strategic funding decisions.
HCL Technologies (Google)Hyderabad, India
Data Engineer Mar 2020 – July 2021
● Optimized ETL pipelines using GCP tools (BigQuery, Dataflow, Cloud Pub/Sub), reducing data latency by 35% for real-time advertising performance analytics.
● Developed and deployed TensorFlow-based machine learning models for ad revenue forecasting, improving forecast accuracy by 15%.
● Enhanced data warehouse efficiency by implementing partitioning and clustering in BigQuery, boosting query performance by 40%.
● Collaborated with cross-functional teams to create real-time dashboards in Google Data Studio and Looker, enabling data-driven decisions for major clients including Walt Disney and The New York Times. Certifications & Achievements:
● Graduate certificate in Applied Machine Learning (University of Texas at Dallas).