DIVYA SHARMA
New Delhi, India +91-798******* *************@*****.**.**
LinkedIn: linkedin.com/in/divya-sharma-791221211 GitHub: github.com/divya2434 Professional Summary
and ETL workflows using Azure Data Factory, Azure Databricks, PySpark, and SQL. Experienced in processing large datasets, optimizing data pipelines, and delivering analytics- ready datasets for reporting and business insights. Skilled in data transformation, performance optimization, and supporting Power BI dashboards to enable data-driven decision making. Possess 10+ years of overall professional experience in data analysis, reporting, and business intelligence.
Technical Skills
Programming & Tools
Python, SQL, PySpark, Power BI, Excel, Power Query Cloud & Big Data
Visualization
Power BI, Matplotlib, Seaborn
Version Control
GitHub
Professional Experience
Developed PySpark transformations in Azure Databricks for large-scale data processing.
Built and maintained data ingestion pipelines ensuring reliable workflows. Sep 2024 – 4 Feb 2026
Data Analyst with 4+ years of experience in designing and building scalable Data Analyst – Innocrazy Tech Services Private Limited data pipelines
Worked on Microsoft fabric for building end-to-end data pipelines using Data Factory. and Lakehouse. I implemented medallion architecture for data processing. Azure Data Factory, Azure Databricks, ETL/ELT pipelines, Microsoft Fabric
Optimized data pipeline performance and improved processing efficiency.
Delivered analytics-ready datasets for Power BI dashboards and reporting. Data Analyst – Hinduja Global Solutions Ltd
Dec 2023 – Aug 2024
Developed scalable ETL pipelines in ADF and Databricks handling high-volume sales and transaction data.
Used PySpark for data transformation, deduplication, and aggregations.
Optimized pipeline performance reducing processing time by 25%.
Created robust SQL queries to support reporting and analytics teams. Data Analyst – Arbre Creations
Jul 2021 – Nov 2023
Designed reusable ADF pipelines to automate ingestion, cleansing, and transformation of structured data.
Built PySpark scripts for large-scale data joins and cleansing.
Implemented staging and production layers with version control and rollback safety.
Improved job runtime and resource utilization using Databricks optimization techniques.
Customer Service Associate (MIS Reporting) – Tech
Mahindra
Sep 2020 – May 2021
Analyzed Excel datasets to generate operational insights.
Created MIS reports and dashboards for internal stakeholders.
Automated daily and weekly performance reports using Excel. MIS Executive – Atomants E-Services
Jul 2015 – Aug 2020
Collected and processed sales data for reporting and analysis.
Performed sales analysis to generate business insights.
Prepared sales reports and dashboards for internal stakeholders. Education & Certifications
Master’s in Data Science
Portfolio Projects
Sales Transaction Pipeline
Automated ETL in PySpark + ADF + Databricks, cleaning and transforming 500k+ daily records, reducing processing time by 25%.
Customer Segmentation
SQL + Python pipeline with Power BI dashboard for insights on customer behavior. Languages
Hindi – Native
English – Proficient
PG Diploma –
B.Com