Gowtam Reddy Gurrala
DATA ENGINEER
Baltimore, MD, *************@*****.***, 443-***-****, LinkedIn
PROFESSIONAL EXPERIENCE
Mckesson MD, USA
Data Engineer Apr 2025 - Current
Implemented and deployed data pipelines with PySpark, Databricks, and Azure Data Factory to process 3+ TB of daily healthcare data.
Combined data from APIs, SQL Server, and Snowflake into cohesive datasets for business analysis and executive reporting.
Collaborated with business stakeholders to establish KPIs, facilitating data-driven business decisions for 8+ business units.
Delivered Delta Lake architecture and Power BI designs, enhancing dashboard performance by 45%.
Executed automated data validation using Great Expectations, making 99% of the data reliable for downstream reports.
Coordinated with finance and operations departments to convert analytical findings into cost reductions
(~$250K per year).
Tools: Azure (ADF, Synapse, Databricks), PySpark, SQL, Power BI, Python, Delta Lake, Snowflake JPMorganChase MD, USA
Data Engineer / Programmer Analyst Sep 2024 - Apr 2025
Developed and operated ETL pipelines in AWS Glue and Airflow, streamlining data workflows in 5 departments.
Created Snowflake data models and dbt transforms for credit risk analysis and regulatory reporting.
Automated data reconciliation with Python and SQL, reducing manual QA time by 70%.
Worked with business analysts, data scientists, and product teams to define and deliver critical datasets for predictive analytics.
Deployed Jenkins-based CI/CD pipelines for ETL code, enhancing release efficiency by 30%.
Collaborated with business teams to onboard metrics tracking on financial transactions (~1M+ records per day).
Tools: AWS (Glue, S3, Redshift, Lambda), Airflow, dbt, Snowflake, Python, SQL, Jenkins, Tableau Trigent Hyderabad, India
Business Systems Analyst May 2020- June 2022
Served as a bridge between business and technical teams to gather requirements for analytics and data warehouse solutions.
Performed data mapping, profiling, and validation for migration projects on SQL Server and Power BI.
Automated KPI tracking reports for customers, enhancing stakeholder visibility by 35%.
Created process documentation, user stories, and UAT scripts through Jira and Confluence.
Reviewed business processes to determine where business processes can be automated utilizing Excel macros and Python scripting.
Tools: SQL Server, Power BI, Jira, Confluence, Excel, Python Adani Hyderabad, India
Business Analyst Junior Mar 2019- Apr 2020
Collected business requirements and translated them into data models and visual dashboards.
Supported operations, logistics, and energy analytics with SQL and Tableau.
Created automated Excel reports and conducted root cause analysis on monthly variance trends.
Collaborated with technical teams to map database schemas to business KPIs.
Tools: SQL, Excel, Tableau, Python (Basic)
TECHNICAL SKILLS
Programming & Scripting: Python (Pandas, NumPy, PySpark, Scikit-learn, SQLAlchemy), SQL (T- SQL, PL/SQL), Scala, R (Basic), Shell Scripting, Bash
Data Engineering & ETL: Databricks, Apache Spark, Delta Lake, Airflow, dbt, Azure Data Factory
(ADF), AWS Glue, Kafka, Snowflake, Synapse Analytics, Informatica, REST APIs, Batch & Streaming Pipelines, CDC (Change Data Capture)
Cloud Platforms: Azure (Synapse, Data Lake, ADF, Databricks), AWS (S3, Redshift, Lambda, Glue), GCP (BigQuery, Cloud Composer)
Databases: Snowflake, SQL Server, PostgreSQL, MySQL, Oracle, MongoDB (Basic), DynamoDB
(Basic)
Data Visualization & Analytics: Power BI (DAX, Power Query), Tableau, Looker, Excel (Pivot, Power Query), Google Data Studio
Business & Process Tools: Jira, Confluence, Lucidchart, Visio, Agile/Scrum, BRD/FRD, Process Flow Mapping, Stakeholder Management, User Stories, UAT
DevOps, CI/CD & Infrastructure: Jenkins, GitHub Actions, GitLab CI, Docker, Terraform (Basic), Kubernetes (Basic), Version Control (Git)
Data Quality, Testing & Governance: Great Expectations, dbt Tests, PyTest, DataFold, Unit Testing, Data Validation Frameworks, Data Lineage, Data Cataloging (Purview, Collibra)
Modeling & Architecture: Star/Snowflake Schema, ER Diagrams, Dimensional Modeling, Data Vault 2.0, Data Mesh Concepts
PROJECTS
Real-Time Healthcare Analytics Platform (Azure + Databricks)
Established streaming ingestion pipeline from various clinical data sources through APIs & SQL.
Processed 5M+ records/day for operational and predictive analytics to support hospital-level KPIs. Financial Risk Data Warehouse (AWS + Snowflake + dbt)
Developed end-to-end data warehouse for risk management through AWS Glue and dbt models.
Designed semantic data layers for business dashboards in Tableau, enhancing accuracy by 40%. Customer Insights Dashboard (Power BI + SQL)
Combined sales, support, and product usage data to develop self-service business user dashboards.
Boosted report adoption by 60% across departments through simplification of visualization and metrics design.
CERTIFICATIONS
Open Source Software Development Methods
AWS Certified Developer – Associate
Machine Learning Algorithms: Supervised Learning
Python Data Structures
Cloud Computing Basics
EDUCATION
University of Maryland Baltimore County – Baltimore, MD Master of Science in Data Science Aug 2022– May 2024 Sreenidhi Institute of Science and Technology – Hyderabad, India Bachelor of Technology in Electronics and Communication Engineering Aug 2016– May 2020