Post Job Free
Sign in

Senior Data Engineer - Cloud-scale ETL & Lakehouse

Location:
Reston, VA
Salary:
130000
Posted:
November 26, 2025

Contact this candidate

Resume:

Tolu Akin

Data Engineer

Dallas, Texas +* (***) - 517-3360 ***********@*****.***

Professional Summary:

Senior Data Engineer with 6+ years of experience designing, building, and optimizing large-scale data solutions across cloud and on-premises environments. Proven expertise in developing high-performance ETL pipelines, real-time streaming, and data lakehouse architectures using Apache Spark, PySpark, Databricks, Kafka, Flink, SSIS, Azure Data Factory, and dbt. Adept at orchestrating workflows with Airflow, Prefect, and Control-M, while ensuring robust data governance, lineage, and security compliance. Skilled in crafting optimized SQL Server and Snowflake data models, implementing AI/ML-powered data quality checks, and preparing curated datasets for advanced analytics and machine learning. Strong collaborator with BI and data science teams to deliver actionable insights through Power BI, Tableau, and SSRS dashboards. Certified in advanced data analytics, with a focus on scalable architecture, performance tuning, and modern data engineering best practices in multi-cloud environments (AWS, Azure, GCP).

Professional Experience:

Dell Technologies USA Oct 2024- Present

Senior Data Engineer

• Designed and implemented scalable data pipelines using PySpark on Databricks to process >5TB/day of clickstream data.

• Migrated legacy ETL jobs to Apache Spark Structured Streaming, reducing latency from 30 mins to near-real-time.

• Optimized Spark jobs using partitioning, caching, and broadcast joins, improving runtime by 40%.

• Automated data quality checks with Great Expectations and integrated monitoring with Airflow.

• Partnered with data scientists to prepare curated datasets for machine learning models.

• Design and maintain enterprise-scale data pipelines using SSIS, Azure Data Factory, and dbt to process high-volume structured and semi-structured data.

• Implement real-time streaming with Kafka and Flink for event-driven analytics and monitoring.

• Optimize SQL Server and Snowflake data models, reducing query latency by 30%.

• Automate data quality checks and anomaly detection using AI/ML models and custom Python scripts.

• Collaborate with BI teams to publish reports/dashboards in Power BI and Tableau.

• Enforce governance and security standards, including access control and lineage tracking. Matrix Point Solutions Dallas, TX Mar 2022- Sep 2024 Senior Data Engineer

• Designed and deployed 20+ Azure Data Factory pipelines, improving data refresh cycles by 35%.

• Utilized Git and Bitbucket to manage version control and collaborate on SQL and ETL development across multiple environments.

• Automated data validation processes, reducing manual QA time by 50%.

• Developed scalable ETL workflows that integrated 10+ on-prem and cloud-based sources.

• Delivered cross-departmental reporting that supported business decisions and reduced turnaround time by 40%.

• Created web jobs for real-time data processing, triggering reports and alerts based on event-driven conditions.

• Converted SAS-based data outputs into SQL Server and Power BI models, improving reporting efficiency by 30%. Intel USA Feb 2020- Mar 2022

Data Engineer

• Migrated on-premises databases (SQL Server, DB2) to Azure and AWS cloud environments.

• Developed REST/SOAP API integrations to synchronize data between SaaS platforms and enterprise warehouses.

• Built CI/CD pipelines in Azure DevOps for automated testing and deployment of data workflows.

• Scheduled and orchestrated ETL jobs using Control-M and Airflow, ensuring 24 7 availability.

• Partnered with stakeholders to improve data compliance processes and meet GDPR/HIPAA requirements. Globe Life Dallas, TX Oct 2019- Jan 2020

Senior Data Analyst/ Power BI Developer

• Developed 75+ Power BI and Tableau reports, increasing reporting efficiency by 60%.

• Optimized SSIS packages reducing ETL job failures by 80%.

• Migrated and automated 30+ Excel/Access reports to SSRS, streamlining delivery cycles.

• Created T-SQL stored procedures and triggers that reduced query execution times by 40%. Educations and Certifications:

• Master’s in Advanced Data Analyst University of North Texas

• Bachelor’s in Business Administration University of Lagos Technical Skills:

• Data Integration & ETL / Pipelines

SSIS, Azure Data Factory, Dell Boomi

Modern pipeline tools: dbt, Prefect, Apache Airflow

• Programming & Scripting

SQL, T-SQL, PL/SQL, Python, C#, JavaScript, HTML/CSS REST & SOAP APIs

Prompt engineering / AI-assisted code generation

• Big Data & Processing:

Apache Spark, PySpark, Spark SQL, Spark Streaming, Structured Streaming, Delta Lake

• Databases & Storage

Microsoft SQL Server (2000-2022), DB2, MS Access

NoSQL / distributed stores (e.g. MongoDB, Cassandra) Data Lake / Lakehouse platforms: Snowflake, Databricks, BigQuery

• Streaming & Real-Time Processing

Kafka, Flink, Pulsar

Cloud & Infrastructure

Cloud Platforms: AWS, Azure, GCP

Serverless computing (AWS Lambda, Azure Functions etc.) Multi-cloud / hybrid architectures

• Business Intelligence & Reporting

Power BI, Tableau, SSRS, Telerik

• AI / ML & Data Quality

Integration of ML pipelines

AI-assisted data monitoring, data anomaly detection

• Governance, Security & Compliance

Data lineage, access control, privacy regulations & policies

• Version Control & Scheduling / DevOps Practices

Azure DevOps, TFS, Control-M

CI/CD for data workflows

• Tools & Productivity

Visual Studio, JIRA, Excel (Advanced)



Contact this candidate