Data Analytics Engineer - ETL, Azure, PySpark, Power BI

Location:

Ghaziabad, Uttar Pradesh, India

Posted:

June 07, 2026

Contact this candidate

Resume:

DIVYA SHARMA

New Delhi, India +91-798******* *************@*****.**.**

LinkedIn: linkedin.com/in/divya-sharma-791221211 GitHub: github.com/divya2434 Professional Summary

and ETL workflows using Azure Data Factory, Azure Databricks, PySpark, and SQL. Experienced in processing large datasets, optimizing data pipelines, and delivering analytics- ready datasets for reporting and business insights. Skilled in data transformation, performance optimization, and supporting Power BI dashboards to enable data-driven decision making. Possess 10+ years of overall professional experience in data analysis, reporting, and business intelligence.

Technical Skills

Programming & Tools

Python, SQL, PySpark, Power BI, Excel, Power Query Cloud & Big Data

Visualization

Power BI, Matplotlib, Seaborn

Version Control

GitHub

Professional Experience

Developed PySpark transformations in Azure Databricks for large-scale data processing.

Built and maintained data ingestion pipelines ensuring reliable workflows. Sep 2024 – 4 Feb 2026

Data Analyst with 4+ years of experience in designing and building scalable Data Analyst – Innocrazy Tech Services Private Limited data pipelines

Worked on Microsoft fabric for building end-to-end data pipelines using Data Factory. and Lakehouse. I implemented medallion architecture for data processing. Azure Data Factory, Azure Databricks, ETL/ELT pipelines, Microsoft Fabric

Optimized data pipeline performance and improved processing efficiency.

Delivered analytics-ready datasets for Power BI dashboards and reporting. Data Analyst – Hinduja Global Solutions Ltd

Dec 2023 – Aug 2024

Developed scalable ETL pipelines in ADF and Databricks handling high-volume sales and transaction data.

Used PySpark for data transformation, deduplication, and aggregations.

Optimized pipeline performance reducing processing time by 25%.

Created robust SQL queries to support reporting and analytics teams. Data Analyst – Arbre Creations

Jul 2021 – Nov 2023

Designed reusable ADF pipelines to automate ingestion, cleansing, and transformation of structured data.

Built PySpark scripts for large-scale data joins and cleansing.

Implemented staging and production layers with version control and rollback safety.

Improved job runtime and resource utilization using Databricks optimization techniques.

Customer Service Associate (MIS Reporting) – Tech

Mahindra

Sep 2020 – May 2021

Analyzed Excel datasets to generate operational insights.

Created MIS reports and dashboards for internal stakeholders.

Automated daily and weekly performance reports using Excel. MIS Executive – Atomants E-Services

Jul 2015 – Aug 2020

Collected and processed sales data for reporting and analysis.

Performed sales analysis to generate business insights.

Prepared sales reports and dashboards for internal stakeholders. Education & Certifications

Master’s in Data Science

Portfolio Projects

Sales Transaction Pipeline

Automated ETL in PySpark + ADF + Databricks, cleaning and transforming 500k+ daily records, reducing processing time by 25%.

Customer Segmentation

SQL + Python pipeline with Power BI dashboard for insights on customer behavior. Languages

Hindi – Native

English – Proficient

PG Diploma –

B.Com

Contact this candidate