DIVYA SHARMA
New Delhi, India +91-798******* *************@*****.**.**
LinkedIn: linkedin.com/in/divya-sharma-791221211 GitHub: github.com/divya2434 Professional Summary
Data Engineer with 3+ years of experience designing and building end-to-end data pipelines using Azure Data Factory, Databricks, PySpark, and SQL. Skilled in large-scale data processing, ETL/ELT workflows, and delivering analytics-ready datasets to support business decisions. Adept at optimizing workflows, ensuring data quality, and integrating BI tools. Technical Skills
Programming & Tools: Python, SQL, PySpark, Power BI, Excel, Power Query Cloud & Big Data: Azure Data Factory, Azure Databricks, ETL/ELT pipelines Visualization: Power BI, Matplotlib, Seaborn
Version Control: GitHub
Professional Experience
Data Analyst – Hinduja Global Solutions Ltd Dec 2023 – Aug 2024
Developed scalable ETL pipelines in ADF and Databricks handling high-volume sales and transaction data.
Used PySpark for data transformation, deduplication, and advanced aggregations.
Optimized pipeline performance, reducing processing time by 25%.
Created robust SQL queries to support reporting and analytics teams. Data Analyst – Arbre Creations Jul 2021 – Nov 2023
Designed reusable ADF pipelines to automate ingestion, cleansing, and transformation of structured data.
Built PySpark scripts for large-scale data joins and cleansing.
Implemented staging and production layers with version control and rollback safety.
Improved job runtime and resource utilization through Databricks optimization techniques.
MIS Executive – Tech Mahindra Sep 2020 – May 2021
Analyzed large Excel datasets and transformed them into actionable insights.
Created automated dashboards and visual reports using Power BI. MIS Executive – Atomants E-Services Jul 2015 – Aug 2020
Collected and processed PoS sales data; performed loss analytics and reporting.
Prepared reports on merchant transactions for internal and external stakeholders. Education & Certifications
Master’s in Data Science
PG Diploma, Symbiosis Centre for Distance Learning (2006 – 2010)
B.Com, Vinayaka Mission University (2002 – 2005) Portfolio Projects
Sales Transaction Pipeline: Automated ETL in PySpark + ADF + Databricks, cleaning and transforming 500k+ daily PoS records, reducing processing time by 25%.
Customer Segmentation: SQL + Python pipeline with Power BI dashboard for insights on customer behavior.
Languages
Hindi: Native
English: Proficient