Tolu Akin
Data Engineer
Dallas, Texas +* (***) - 517-3360 ***********@*****.***
Professional Summary:
Senior Data Engineer with 6+ years of experience designing, building, and optimizing large-scale data solutions across cloud and on-premises environments. Proven expertise in developing high-performance ETL pipelines, real-time streaming, and data lakehouse architectures using Apache Spark, PySpark, Databricks, Kafka, Flink, SSIS, Azure Data Factory, and dbt. Adept at orchestrating workflows with Airflow, Prefect, and Control-M, while ensuring robust data governance, lineage, and security compliance. Skilled in crafting optimized SQL Server and Snowflake data models, implementing AI/ML-powered data quality checks, and preparing curated datasets for advanced analytics and machine learning. Strong collaborator with BI and data science teams to deliver actionable insights through Power BI, Tableau, and SSRS dashboards. Certified in advanced data analytics, with a focus on scalable architecture, performance tuning, and modern data engineering best practices in multi-cloud environments (AWS, Azure, GCP).
Professional Experience:
Dell Technologies USA Oct 2024- Present
Senior Data Engineer
• Designed and implemented scalable data pipelines using PySpark on Databricks to process >5TB/day of clickstream data.
• Migrated legacy ETL jobs to Apache Spark Structured Streaming, reducing latency from 30 mins to near-real-time.
• Optimized Spark jobs using partitioning, caching, and broadcast joins, improving runtime by 40%.
• Automated data quality checks with Great Expectations and integrated monitoring with Airflow.
• Partnered with data scientists to prepare curated datasets for machine learning models.
• Design and maintain enterprise-scale data pipelines using SSIS, Azure Data Factory, and dbt to process high-volume structured and semi-structured data.
• Implement real-time streaming with Kafka and Flink for event-driven analytics and monitoring.
• Optimize SQL Server and Snowflake data models, reducing query latency by 30%.
• Automate data quality checks and anomaly detection using AI/ML models and custom Python scripts.
• Collaborate with BI teams to publish reports/dashboards in Power BI and Tableau.
• Enforce governance and security standards, including access control and lineage tracking. Matrix Point Solutions Dallas, TX Mar 2022- Sep 2024 Senior Data Engineer
• Designed and deployed 20+ Azure Data Factory pipelines, improving data refresh cycles by 35%.
• Utilized Git and Bitbucket to manage version control and collaborate on SQL and ETL development across multiple environments.
• Automated data validation processes, reducing manual QA time by 50%.
• Developed scalable ETL workflows that integrated 10+ on-prem and cloud-based sources.
• Delivered cross-departmental reporting that supported business decisions and reduced turnaround time by 40%.
• Created web jobs for real-time data processing, triggering reports and alerts based on event-driven conditions.
• Converted SAS-based data outputs into SQL Server and Power BI models, improving reporting efficiency by 30%. Intel USA Feb 2020- Mar 2022
Data Engineer
• Migrated on-premises databases (SQL Server, DB2) to Azure and AWS cloud environments.
• Developed REST/SOAP API integrations to synchronize data between SaaS platforms and enterprise warehouses.
• Built CI/CD pipelines in Azure DevOps for automated testing and deployment of data workflows.
• Scheduled and orchestrated ETL jobs using Control-M and Airflow, ensuring 24 7 availability.
• Partnered with stakeholders to improve data compliance processes and meet GDPR/HIPAA requirements. Globe Life Dallas, TX Oct 2019- Jan 2020
Senior Data Analyst/ Power BI Developer
• Developed 75+ Power BI and Tableau reports, increasing reporting efficiency by 60%.
• Optimized SSIS packages reducing ETL job failures by 80%.
• Migrated and automated 30+ Excel/Access reports to SSRS, streamlining delivery cycles.
• Created T-SQL stored procedures and triggers that reduced query execution times by 40%. Educations and Certifications:
• Master’s in Advanced Data Analyst University of North Texas
• Bachelor’s in Business Administration University of Lagos Technical Skills:
• Data Integration & ETL / Pipelines
SSIS, Azure Data Factory, Dell Boomi
Modern pipeline tools: dbt, Prefect, Apache Airflow
• Programming & Scripting
SQL, T-SQL, PL/SQL, Python, C#, JavaScript, HTML/CSS REST & SOAP APIs
Prompt engineering / AI-assisted code generation
• Big Data & Processing:
Apache Spark, PySpark, Spark SQL, Spark Streaming, Structured Streaming, Delta Lake
• Databases & Storage
Microsoft SQL Server (2000-2022), DB2, MS Access
NoSQL / distributed stores (e.g. MongoDB, Cassandra) Data Lake / Lakehouse platforms: Snowflake, Databricks, BigQuery
• Streaming & Real-Time Processing
Kafka, Flink, Pulsar
Cloud & Infrastructure
Cloud Platforms: AWS, Azure, GCP
Serverless computing (AWS Lambda, Azure Functions etc.) Multi-cloud / hybrid architectures
• Business Intelligence & Reporting
Power BI, Tableau, SSRS, Telerik
• AI / ML & Data Quality
Integration of ML pipelines
AI-assisted data monitoring, data anomaly detection
• Governance, Security & Compliance
Data lineage, access control, privacy regulations & policies
• Version Control & Scheduling / DevOps Practices
Azure DevOps, TFS, Control-M
CI/CD for data workflows
• Tools & Productivity
Visual Studio, JIRA, Excel (Advanced)