Post Job Free
Sign in

Data Engineer Azure

Location:
St. Louis, MO
Posted:
March 25, 2025

Contact this candidate

Resume:

Hemalatha Yalamanchi

Azure Data Engineer

+1-314-***-**** *********************@*****.***

www.linkedin.com/in/hemalatha-yalamanchi-2b7621353

PROFESSIONAL SUMMARY

Results-driven Azure Data Engineer with 8+ years of experience in designing, developing, and optimizing cloud-based data solutions. Skilled in implementing end-to-end data pipelines using Azure Data Factory, Azure Synapse, Databricks, and Data Lakes. Expertise in ETL processes, big data frameworks, SQL development, and data governance. Proven ability to build scalable, high-performance cloud solutions to support enterprise analytics and business intelligence. Strong background in the healthcare, banking, and retail sectors, ensuring compliance with HIPAA, GDPR, and CCPA regulations.

CORE COMPETENCIES

Azure Cloud Services: Azure Data Factory (ADF), Azure Synapse, Azure Databricks, Azure Data Lake, Azure SQL, Azure Blob Storage, Event Hubs, Logic Apps, Azure Functions

ETL & Data Processing: Data ingestion, transformation, and curation using ADF, Databricks (PySpark), and SQL-based frameworks

Big Data & Analytics: Apache Spark, Delta Lake, Hadoop, Hive, Power BI, Data Modeling

Programming & Scripting: Python, PySpark, SQL (T-SQL, PL/SQL), Shell Scripting, Scala

Data Warehousing: Azure Synapse Analytics, Snowflake, SQL Server, Redshift, BigQuery

Data Governance & Security: Metadata management, data lineage, RBAC, encryption, compliance (HIPAA, GDPR)

DevOps & Automation: Terraform, Azure DevOps (CI/CD), GitHub, Docker, Kubernetes

PROFESSIONAL EXPERIENCE

Truist Bank, Atlanta, GA

Senior Azure Data Engineer • Feb 2023 – Present

Designed and developed scalable Azure Data Factory (ADF) pipelines to ingest and transform large volumes of banking data from various sources (On-Prem, APIs, Cloud).

Developed ETL processes in Azure Databricks using PySpark, reducing data transformation time by 40%.

Optimized Azure Synapse and Data Lake Storage to enhance performance and cost efficiency for enterprise analytics.

Built data security frameworks using Azure Key Vault, RBAC, and encryption techniques to comply with banking regulations.

Developed and deployed CI/CD pipelines in Azure DevOps to automate data pipeline deployment and minimize downtime.

Created Power BI dashboards for real-time financial reporting, improving executive decision-making.

Johnson & Johnson, New Brunswick, NJ

Azure Data Engineer • Oct 2021 – Jan 2023

Built and optimized Azure-based data pipelines for clinical trial and pharmaceutical data, ensuring HIPAA compliance.

Migrated on-prem data warehouses to Azure Synapse, improving query performance by 30%.

Developed incremental data ingestion strategies in Azure Data Lake to handle structured and unstructured datasets.

Implemented Data Governance standards and integrated Azure Purview for metadata management.

Automated data quality monitoring with Databricks and Azure Monitor, reducing data discrepancies by 35%.

Led a data lineage tracking initiative using Azure Data Catalog to improve transparency across teams.

Aditya Birla Retail, Mumbai, India

Azure Data Engineer • Apr 2018 – Jul 2021

Designed and developed Azure-based ETL pipelines to support large-scale retail analytics and customer insights.

Migrated data from SQL Server to Azure Synapse, ensuring efficient data partitioning and indexing strategies for improved performance.

Implemented Azure Logic Apps and Event Hubs for streaming real-time sales data from POS systems to Azure Data Lake.

Developed interactive Power BI dashboards to provide visibility into supply chain and inventory trends.

Optimized data workflows in Azure Data Factory (ADF) by implementing Data Flow Activities and parameterized pipelines.

Bytridge, India

Data Engineer • Jun 2016 – Mar 2019

Designed data pipelines using ADF and SQL Server to support enterprise reporting needs.

Implemented data lake storage architecture to centralize transactional and analytical datasets.

Developed Python scripts to automate data ingestion and transformation in Azure Databricks.

Ensured data security and access controls using Azure Active Directory (AAD) and Managed Identities.

Created Power BI reports to track key business performance metrics.

EDUCATION

Bachelor of Technology in Computer Science

Lovely Professional University (LPU), Punjab, India – May 2016

KEY PROJECTS & IMPACT

Enterprise Azure Data Lake Migration

Led migration of 10TB+ of structured and unstructured data from on-prem databases to Azure Data Lake.

Improved query performance by 50% through data partitioning and indexing strategies in Azure Synapse.

Real-time Banking Data Streaming

Developed real-time data ingestion pipeline using Azure Event Hubs and Databricks, reducing data latency from 30 minutes to 5 seconds.

Ensured data security and compliance using Azure Key Vault and Managed Identities.

Retail Customer Insights & Analytics

Designed and deployed customer segmentation models using Azure Machine Learning and Databricks, improving targeted marketing campaigns.

Built Power BI dashboards for real-time sales forecasting and inventory optimization.

TOOLS & TECHNOLOGIES

Azure Cloud: Azure Data Factory, Azure Synapse, Azure Databricks, Azure Data Lake, Azure SQL, Blob Storage, Event Hubs, Logic Apps

Big Data: Apache Spark, Hadoop, Delta Lake, Hive, Kafka

ETL & Data Processing: ADF, Databricks (PySpark), SQL, Informatica, SSIS

Programming: Python, PySpark, SQL (T-SQL, PL/SQL), Shell Scripting

DevOps & Automation: Azure DevOps, Terraform, GitHub Actions, Docker, Kubernetes

BI & Reporting: Power BI, Tableau, Looker Studio



Contact this candidate