Data Engineer

Location:

Delhi, ON, Canada

Posted:

June 05, 2026

Contact this candidate

Original resume on Jobvertise

Resume:

AKSHIT GANGWAR

Data Engineer Data Analyst

+91-807**-***** *****************@*****.*** linkedin.com/in/akshitgangwar

PROFESSIONAL SUMMARY

Data Engineer with 3+ years building production-grade ETL/ELT pipelines across Azure and AWS. Cut PySpark runtimes by 30% and

report refresh times from hours to minutes while hardening 30+ pipelines with data-quality checks, CI/CD, and governance. Skilled in

PySpark, SQL, Delta Lake, and dimensional modeling, turning multi-terabyte data into trusted, decision-ready analytics.

TECHNICAL SKILLS

Languages & Querying: Python (PySpark, Pandas), SQL (CTEs, Window Functions, Spark SQL), Scala

Data Engineering: ETL/ELT Pipelines, Delta Lake, Lakehouse Architecture, Star Schema & Dimensional Modeling, Data Warehousing,

Data Quality, Apache Iceberg

Azure: Databricks, Synapse Analytics, Data Factory (ADF), ADLS Gen2, Unity Catalog

AWS: Redshift, Glue, EMR, S3, Athena, Lambda

Databases & Warehouses: Snowflake, BigQuery, NoSQL

Orchestration & DevOps: Apache Airflow, Azure DevOps, Git, CI/CD, Agile/Scrum

BI & Visualization: Power BI (DAX, RLS, Power Query), Tableau, Amazon QuickSight, Excel

PROFESSIONAL EXPERIENCE

MAQ Software ? Data Engineer Sep 2025 ? Present

? Cut PySpark job execution time by 30% by restructuring transformation logic, tuning Databricks clusters, and parallelizing high-

volume workloads ? strengthening daily SLA adherence.

? Built end-to-end ELT pipelines in Azure Data Factory orchestrating Databricks notebooks spanning multi-terabyte Delta Lake and

Synapse workloads, establishing a governed single source of truth for downstream BI.

? Migrated 30+ production PySpark notebooks into a Git-versioned, CI/CD-deployed framework on Azure DevOps, wiring in data-

quality checks that hardened reliability and governance.

? Designed Star Schema dimensional models on Delta Lake and Azure Synapse Analytics for high-volume analytical workloads

powering enterprise reporting.

? Enforced RBAC, PII encryption, and compliance controls on every pipeline, meeting 100% of organizational security and audit

requirements.

LUMIQ (Crisp Analytics) ? Data Analyst Jan 2023 ? Sep 2025

? Reduced report refresh time from 2 hours to 15 minutes (8x faster) by re-architecting Python/PySpark ETL on AWS Glue with

scheduled, automated workflows.

? Delivered enterprise Power BI dashboards with advanced DAX (MoM, YTD, variance) to 500+ users, cutting ad-hoc reporting load

and accelerating decision cycles by 40%.

? Standardized KPI definitions across business units through multi-source SQL analysis on Redshift and Athena, lifting forecasting

accuracy by 18%.

? Presented storytelling dashboards to C-suite leadership, driving organization-wide adoption of data-driven planning frameworks.

PROJECTS

Sales & Revenue Analysis Dashboard ? Power BI, SQL, Python, AWS Redshift

? Modeled multi-domain datasets with Star Schema into a unified dashboard enabling regional revenue tracking, deviation analysis,

and forecasting; added Python validation scripts that enforced consistency and removed cross-unit reporting discrepancies.

Operational Efficiency Insights ? Power BI, Azure Data Factory, Databricks

? Developed Delta Lake monitoring dashboards that surfaced operational bottlenecks (cut turnaround time 20%) and configured

parameterized, trigger-scheduled ADF pipelines that boosted end-to-end reporting efficiency 60%.

EDUCATION

IIIT Bhubaneswar ? B.Tech, Information Technology Jul 2019 ? Jun 2023

CERTIFICATIONS

Microsoft Certified: Fabric Analytics Engineer Associate (2024) ? AWS Certified Developer ? Associate (2024)

Contact this candidate