Post Job Free
Sign in

Data Engineer Power Bi

Location:
Plainfield, IL
Posted:
May 21, 2025

Contact this candidate

Resume:

R A H U L T E J A B O L L O J U

• Chicago, IL • *.***********@*****.*** • +1-906-***-**** • Linkedin • Portfolio • Open to Relocate

Highly determined Data Engineer with 4+ years managing petabytes of data. Experienced in building Scalable Data Lakehouses, 200+ Batch and Streaming ETL pipelines, and Big Data Infrastructure to enhance data accessibility and reliability

SKILLS:

●Programming Languages: Python (PySpark, Pandas, NumPy), SQL (T-SQL, PL/SQL, SparkSQL), Scala

●Big Data Technologies: Spark, Airflow, Kafka, HDFS, Hive, Presto, Hadoop

●ETL & Visualization: Airflow, Glue, Azure Data Factory, Dagster, DBT, Databricks, Fivetran, Power BI, Tableau, QuickSig

●Databases & Storage: Snowflake, PostgreSQL, Redshift, ADLS, Azure Synapse, S3, MySQL, MSSQL, MongoDB (NoSQL)

●Other Tools: Git, Agile, JIRA, Docker, VS code, Kubernetes, CI/CD Pipelines, MLflow

EXPERIENCE:

Data Engineer Jun 2024 - Present

T-Mobile Seattle, WA

●Spearheaded the architecture of a centralized Azure Data Lake, enabling cross-functional business teams to access next-day operational insights and accelerating analytics delivery by 40%.

●Engineered secure and scalable ETL pipelines with Azure Data Factory and Databricks to ingest and transform enterprise-scale datasets from SQL and on-prem systems, reducing data refresh latency by 35%.

●Designed a distributed data model in Azure Synapse Analytics using intelligent partitioning and distribution techniques, improving dashboard query performance by 3x.

●Delivered executive-facing Power BI dashboards visualizing weekly product KPIs, directly informing performance tracking and strategic decisions across retail, online, and B2B channels.

●Led migration of Power BI workspaces from Premium (P SKU) Azure Fabric (F SKU), cutting infrastructure costs while enhancing governance, workspace scalability, and refresh reliability.

Data Engineer Sep 2023 - May 2024

Michigan Technological University Houghton, MI

●Processed 10+ million time series data records, building batch data pipelines using BigQuery and streaming pipelines using DataFlow (Apache Beam). Used Airflow for orchestration and Composer for deployment.

●Designed and operationalized customized ETL pipelines by addressing ad hoc requests for different departments using SSIS for MsSql, Oracle Data Integrator (ODI), and Teradata, ensuring seamless data flow to cater to specific analytical demands.

●Enhanced Database efficiency, Memory utilization, and improved System Functionality by 15%.

Data Science Intern Jul 2023 - Aug 2023

Pitney Bowes Stamford, CT

●Predicted defaulters using an XGBoost model, achieving 0.76 AUC score to enhance high-risk customer outreach strategies.

●Orchestrated the Power BI dashboard, improving debt visualization based on stakeholder input for actionable KPIs.

●Unified and analyzed multi-source data from Snowflake, processing 12 million records to optimize debt collection and improve agent productivity.

●Led analytical initiatives for Pitney Bowes' top 200 clients, refining ARIMA and Prophet models for time series forecasting.

●Reduced MAPE by 15% for 60% of clients using client-specific logic, enhancing model precision with advanced cross-validation.

Data Engineer Jan 2020 - Jul 2022

FonkR Solutions Hyderabad, TG

●Lake-house foundation on AWS: Modeled 30+ fact/dim tables across S3 (Bronze/Silver) + Redshift (Gold); automated 25+ pipelines with AWS Glue Studio + Fivetran, slashing month-end close 30% and retiring legacy SQL Server to save $25 K/yr.

●Real-time ingestion at scale: piped 50K msg/s through Amazon MSK (Kafka) Kinesis Data Firehose Redshift, delivering exactly-once semantics and 25% lower dashboard latency; MSK tiered storage cut streaming cost 15%.

●BI modernization: migrated 70+ Tableau workbooks to Amazon QuickSight + Athena over partitioned Parquet on S3, eliminating 38% data redundancy and reducing BI licensing by $18 K/yr.

●CRM event automation: integrated Zoho CRM via API Gateway, Lambda & Step Functions, streaming lead-touch events to Redshift; improved campaign-performance visibility by 24%, enabling targeted spend optimization

EDUCATION:

Michigan Technological University

Master of Science in Data Science Houghton, MI



Contact this candidate