Post Job Free
Sign in

Data Engineer Senior

Location:
Dallas, TX
Salary:
110K
Posted:
September 10, 2025

Contact this candidate

Resume:

ABDUL SALAM

Dallas, Texas +1-913-***-**** *******************@*****.***

SUMMARY

Senior Data Engineer with 6 years of cloud experience delivering robust, scalable data platforms. Proven ability to design and optimize performance-driven ETL pipelines using Apache Spark for both streaming and batch workloads. Skilled in implementing Change Data Capture processes and managing data lakes, while ensuring data integrity and usability for analytics. SKILLS

• Cloud & Data Platforms: Azure (ADF, ADLS, Synapse), Azure Databricks, AWS (S3, Redshift, Lambda), AWS Skillset

• Databases & SQL: MS SQL Server, PostgreSQL, T-SQL, Query Optimization, Indexing

• Data Engineering: ETL/ELT Pipelines, Data Integration, Data Lineage, Data Governance, Change Data Capture

• Big Data & Distributed Systems: Apache Spark (Databricks), Delta Lake, Hive, Apache Griffin, Apache Hudi

• BI & Visualization: Power BI, Grafana, Tableau

• Programming: Python, PowerShell, PySpark, Object-Oriented Design, Java, Scala

• DevOps & Automation: Git, Bitbucket, CI/CD, Airflow, Monitoring, Agile Practices

• Domain Focus: Healthcare Data, Claims Processing, ML Workflows, Compliance WORK EXPERIENCE

LTIMindtree Dec 2021 - Dec 2023

Senior Data Engineer Hyderabad

• Designed and deployed high-throughput data pipelines using Azure Data Factory and Azure Databricks, transforming terabytes of healthcare and transactional data.

• Developed and optimized scalable Apache Spark data pipelines using Databricks Data Frames for both streaming and batch processing, enhancing performance and cost-efficiency.

• Tuned and managed large-scale MS SQL Server workloads; wrote efficient queries, indexes, and partitioning strategies.

• Delivered secure, dynamic dashboards in Power BI to meet compliance and executive reporting demands.

• Integrated Delta Lake and implemented data versioning to improve traceability across the data lifecycle.

• Automated monitoring and alerting pipelines in Azure to reduce downtime and enhance visibility into data quality.

• Collaborated with data architects, compliance officers, and DevOps teams to align engineering efforts with regulatory and security standards.

Concentrix Feb 2019 - Dec 2021

Data Engineer Hyderabad

• Engineered ETL pipelines on AWS (S3, Redshift, Lambda) incorporating Change Data Capture techniques to consolidate business and healthcare data from multiple global systems.

• Developed automated reporting tools using Python and SQL, supporting internal analytics and third-party integrations.

• Optimized performance of cloud-native queries in Redshift, reducing query time by over 40% for high-volume datasets.

• Supported early-stage machine learning pipelines, integrating model predictions into dashboards and workflows.

• Implemented CI/CD for data processing and collaborated with ML teams on experiment tracking and inference APIs.

• Conducted training sessions for cross-functional teams on AWS services and fundamentals of ML model integration. EDUCATION

University of Central Missouri 2025

Masters, Data Science & AI

Jawaharlal Nehru Technological University 2018

Bachelors, Computer Science

ACADEMIC PROJECT

AI-Driven Predictive Analytics for Healthcare Claims University of Central Missouri

• Built machine learning models using Python (scikit-learn, pandas) to predict high-risk and potentially fraudulent insurance claims.

• Integrated and processed large-scale healthcare datasets in a cloud environment (AWS EC2 + S3).

• Developed feature engineering workflows and conducted model evaluation (ROC-AUC, precision-recall).

• Visualized patterns and fraud risk using Power BI, supporting interpretability for non-technical stakeholders.

• Implemented automated retraining and versioning scripts to ensure model lifecycle consistency. CERTIFICATIONS

• Microsoft Certified: Azure Fundamentals

• Databricks Lakehouse Fundamentals:in progress



Contact this candidate