AKSHIT GANGWAR
Data Engineer Data Analyst
+91-807**-***** *****************@*****.*** linkedin.com/in/akshitgangwar
PROFESSIONAL SUMMARY
Data Engineer with 3+ years building production-grade ETL/ELT pipelines across Azure and AWS. Cut PySpark runtimes by 30% and
report refresh times from hours to minutes while hardening 30+ pipelines with data-quality checks, CI/CD, and governance. Skilled in
PySpark, SQL, Delta Lake, and dimensional modeling, turning multi-terabyte data into trusted, decision-ready analytics.
TECHNICAL SKILLS
Languages & Querying: Python (PySpark, Pandas), SQL (CTEs, Window Functions, Spark SQL), Scala
Data Engineering: ETL/ELT Pipelines, Delta Lake, Lakehouse Architecture, Star Schema & Dimensional Modeling, Data Warehousing,
Data Quality, Apache Iceberg
Azure: Databricks, Synapse Analytics, Data Factory (ADF), ADLS Gen2, Unity Catalog
AWS: Redshift, Glue, EMR, S3, Athena, Lambda
Databases & Warehouses: Snowflake, BigQuery, NoSQL
Orchestration & DevOps: Apache Airflow, Azure DevOps, Git, CI/CD, Agile/Scrum
BI & Visualization: Power BI (DAX, RLS, Power Query), Tableau, Amazon QuickSight, Excel
PROFESSIONAL EXPERIENCE
MAQ Software ? Data Engineer Sep 2025 ? Present
? Cut PySpark job execution time by 30% by restructuring transformation logic, tuning Databricks clusters, and parallelizing high-
volume workloads ? strengthening daily SLA adherence.
? Built end-to-end ELT pipelines in Azure Data Factory orchestrating Databricks notebooks spanning multi-terabyte Delta Lake and
Synapse workloads, establishing a governed single source of truth for downstream BI.
? Migrated 30+ production PySpark notebooks into a Git-versioned, CI/CD-deployed framework on Azure DevOps, wiring in data-
quality checks that hardened reliability and governance.
? Designed Star Schema dimensional models on Delta Lake and Azure Synapse Analytics for high-volume analytical workloads
powering enterprise reporting.
? Enforced RBAC, PII encryption, and compliance controls on every pipeline, meeting 100% of organizational security and audit
requirements.
LUMIQ (Crisp Analytics) ? Data Analyst Jan 2023 ? Sep 2025
? Reduced report refresh time from 2 hours to 15 minutes (8x faster) by re-architecting Python/PySpark ETL on AWS Glue with
scheduled, automated workflows.
? Delivered enterprise Power BI dashboards with advanced DAX (MoM, YTD, variance) to 500+ users, cutting ad-hoc reporting load
and accelerating decision cycles by 40%.
? Standardized KPI definitions across business units through multi-source SQL analysis on Redshift and Athena, lifting forecasting
accuracy by 18%.
? Presented storytelling dashboards to C-suite leadership, driving organization-wide adoption of data-driven planning frameworks.
PROJECTS
Sales & Revenue Analysis Dashboard ? Power BI, SQL, Python, AWS Redshift
? Modeled multi-domain datasets with Star Schema into a unified dashboard enabling regional revenue tracking, deviation analysis,
and forecasting; added Python validation scripts that enforced consistency and removed cross-unit reporting discrepancies.
Operational Efficiency Insights ? Power BI, Azure Data Factory, Databricks
? Developed Delta Lake monitoring dashboards that surfaced operational bottlenecks (cut turnaround time 20%) and configured
parameterized, trigger-scheduled ADF pipelines that boosted end-to-end reporting efficiency 60%.
EDUCATION
IIIT Bhubaneswar ? B.Tech, Information Technology Jul 2019 ? Jun 2023
CERTIFICATIONS
Microsoft Certified: Fabric Analytics Engineer Associate (2024) ? AWS Certified Developer ? Associate (2024)