Post Job Free
Sign in

Data Engineer Spark, Snowflake, Azure, BI Specialist

Location:
Denton, TX
Salary:
65000
Posted:
November 12, 2025

Contact this candidate

Resume:

Gowtam Reddy Gurrala

DATA ENGINEER

Baltimore, MD, *************@*****.***, 443-***-****, LinkedIn

PROFESSIONAL EXPERIENCE

Mckesson MD, USA

Data Engineer Apr 2025 - Current

Implemented and deployed data pipelines with PySpark, Databricks, and Azure Data Factory to process 3+ TB of daily healthcare data.

Combined data from APIs, SQL Server, and Snowflake into cohesive datasets for business analysis and executive reporting.

Collaborated with business stakeholders to establish KPIs, facilitating data-driven business decisions for 8+ business units.

Delivered Delta Lake architecture and Power BI designs, enhancing dashboard performance by 45%.

Executed automated data validation using Great Expectations, making 99% of the data reliable for downstream reports.

Coordinated with finance and operations departments to convert analytical findings into cost reductions

(~$250K per year).

Tools: Azure (ADF, Synapse, Databricks), PySpark, SQL, Power BI, Python, Delta Lake, Snowflake JPMorganChase MD, USA

Data Engineer / Programmer Analyst Sep 2024 - Apr 2025

Developed and operated ETL pipelines in AWS Glue and Airflow, streamlining data workflows in 5 departments.

Created Snowflake data models and dbt transforms for credit risk analysis and regulatory reporting.

Automated data reconciliation with Python and SQL, reducing manual QA time by 70%.

Worked with business analysts, data scientists, and product teams to define and deliver critical datasets for predictive analytics.

Deployed Jenkins-based CI/CD pipelines for ETL code, enhancing release efficiency by 30%.

Collaborated with business teams to onboard metrics tracking on financial transactions (~1M+ records per day).

Tools: AWS (Glue, S3, Redshift, Lambda), Airflow, dbt, Snowflake, Python, SQL, Jenkins, Tableau Trigent Hyderabad, India

Business Systems Analyst May 2020- June 2022

Served as a bridge between business and technical teams to gather requirements for analytics and data warehouse solutions.

Performed data mapping, profiling, and validation for migration projects on SQL Server and Power BI.

Automated KPI tracking reports for customers, enhancing stakeholder visibility by 35%.

Created process documentation, user stories, and UAT scripts through Jira and Confluence.

Reviewed business processes to determine where business processes can be automated utilizing Excel macros and Python scripting.

Tools: SQL Server, Power BI, Jira, Confluence, Excel, Python Adani Hyderabad, India

Business Analyst Junior Mar 2019- Apr 2020

Collected business requirements and translated them into data models and visual dashboards.

Supported operations, logistics, and energy analytics with SQL and Tableau.

Created automated Excel reports and conducted root cause analysis on monthly variance trends.

Collaborated with technical teams to map database schemas to business KPIs.

Tools: SQL, Excel, Tableau, Python (Basic)

TECHNICAL SKILLS

Programming & Scripting: Python (Pandas, NumPy, PySpark, Scikit-learn, SQLAlchemy), SQL (T- SQL, PL/SQL), Scala, R (Basic), Shell Scripting, Bash

Data Engineering & ETL: Databricks, Apache Spark, Delta Lake, Airflow, dbt, Azure Data Factory

(ADF), AWS Glue, Kafka, Snowflake, Synapse Analytics, Informatica, REST APIs, Batch & Streaming Pipelines, CDC (Change Data Capture)

Cloud Platforms: Azure (Synapse, Data Lake, ADF, Databricks), AWS (S3, Redshift, Lambda, Glue), GCP (BigQuery, Cloud Composer)

Databases: Snowflake, SQL Server, PostgreSQL, MySQL, Oracle, MongoDB (Basic), DynamoDB

(Basic)

Data Visualization & Analytics: Power BI (DAX, Power Query), Tableau, Looker, Excel (Pivot, Power Query), Google Data Studio

Business & Process Tools: Jira, Confluence, Lucidchart, Visio, Agile/Scrum, BRD/FRD, Process Flow Mapping, Stakeholder Management, User Stories, UAT

DevOps, CI/CD & Infrastructure: Jenkins, GitHub Actions, GitLab CI, Docker, Terraform (Basic), Kubernetes (Basic), Version Control (Git)

Data Quality, Testing & Governance: Great Expectations, dbt Tests, PyTest, DataFold, Unit Testing, Data Validation Frameworks, Data Lineage, Data Cataloging (Purview, Collibra)

Modeling & Architecture: Star/Snowflake Schema, ER Diagrams, Dimensional Modeling, Data Vault 2.0, Data Mesh Concepts

PROJECTS

Real-Time Healthcare Analytics Platform (Azure + Databricks)

Established streaming ingestion pipeline from various clinical data sources through APIs & SQL.

Processed 5M+ records/day for operational and predictive analytics to support hospital-level KPIs. Financial Risk Data Warehouse (AWS + Snowflake + dbt)

Developed end-to-end data warehouse for risk management through AWS Glue and dbt models.

Designed semantic data layers for business dashboards in Tableau, enhancing accuracy by 40%. Customer Insights Dashboard (Power BI + SQL)

Combined sales, support, and product usage data to develop self-service business user dashboards.

Boosted report adoption by 60% across departments through simplification of visualization and metrics design.

CERTIFICATIONS

Open Source Software Development Methods

AWS Certified Developer – Associate

Machine Learning Algorithms: Supervised Learning

Python Data Structures

Cloud Computing Basics

EDUCATION

University of Maryland Baltimore County – Baltimore, MD Master of Science in Data Science Aug 2022– May 2024 Sreenidhi Institute of Science and Technology – Hyderabad, India Bachelor of Technology in Electronics and Communication Engineering Aug 2016– May 2020



Contact this candidate