Post Job Free
Sign in

Data Engineer Senior

Location:
Ashburn, VA
Posted:
May 10, 2025

Contact this candidate

Resume:

Selvarajan J. Mudliar

Email: ************@*****.*** Residence: Ashburn, Virginia

Mobile: 859-***-**** Clearance: Active Secret

Professional Experience

Senior Data Engineer with over 10 years of expertise in data engineering, specializing in advanced big data and cloud solutions. Leveraging skills in PySpark, Databricks and Hadoop, excels in optimizing data processing pipelines and ETL processes, ensuring seamless data transformation and integration. A visionary leader committed to driving cross-agency data standardization and pioneering cloud migration strategies to enhance data accuracy and operational efficiency.

Skills

Databricks 13.3. Pyspark 3.2.1, Python 3, StreamSets Control Hub 3.50.1, Unix Script, Informatica Power Center 8.1.1, Pentaho 5.0.6, Big Data ((Hadoop 1.1.0-cdh5.5.2, Spark 1.6.0, Hive 2.0, Sqoop-1.4.6), Informix 4gl,Oracle,AWS (S3/Redshift) Employment History

Senior Data Engineer May 2019 - Present

Booz Allen Hamilton Inc.

• Implemented data processing pipelines using Databricks (PySpark) for FEVS 2024.Engineered data processing pipelines using PySpark for FEVS 2024, ensuring robust data transformation and quality assurance for large-scale federal surveys

• Executed precise field mapping between OPM and DOD databases, establishing standardized data integration protocols while maintaining data integrity

• Developed automated debugging workflows for PySpark logic, streamlining code verification processes and reducing troubleshooting time

• Optimized data transformation workflows, leading to improved processing efficiency and enhanced data accuracy across federal systems

• Spearheaded cross-agency data standardization initiatives, facilitating seamless information exchange between OPM and DOD platforms

• Architected scalable data processing frameworks using PySpark, enabling efficient handling of complex federal survey data while ensuring system reliability

• Orchestrated comprehensive field mapping strategies between federal databases, establishing robust data synchronization while maintaining strict compliance standards

• Modernized legacy data integration processes, delivering enhanced system interoperability and streamlined cross-agency data exchange capabilities

• Pioneered advanced automation solutions for data quality assurance, enabling consistent validation across large-scale federal information systems

• Designed and implemented machine learning algorithms for automated data quality validation, enhancing accuracy in cross-agency data exchanges

• Refined PySpark data processing architecture for federal surveys, improving system performance and reducing processing bottlenecks

• Established comprehensive data governance protocols for OPM-DOD field mappings, ensuring consistent data integrity and compliance

• Engineered sophisticated field mapping algorithms between OPM and DOD databases, ensuring seamless data synchronization while maintaining strict compliance protocols

• Streamlined debugging workflows for complex PySpark operations, reducing system troubleshooting time and enhancing overall pipeline reliability

• Implemented comprehensive data governance frameworks for cross-agency information exchange, strengthening data integrity and standardization practices Databricks v13,3 on AWS. (Using Pyspark in Spark 3.2.1), StreamSets Control Hub 3.50.1, Unix shell script

ETL Engineer Big Data May 2014 – April 2019

ManTech International Corporation.

• Creating/executing catchup jobs for AWS from current system. Engineered data transformation workflows across multiple MIDAS modules using Pentaho 5.0.6, ensuring seamless integration and enhanced data quality for business operations

• Implemented Spark framework solutions for automated XML parsing and Hadoop ingestion, streamlining data processing workflows and reducing manual intervention

• Orchestrated comprehensive AWS migration strategy, successfully transitioning from Oracle to Redshift while maintaining data integrity and system performance

• Developed and optimized Hive queries for complex data analysis, enabling efficient data exploration and actionable insights for stakeholders

• Spearheaded parallel job execution validation between legacy and AWS environments, ensuring consistent data accuracy while maintaining production schedules

• Engineered complex data transformation pipelines across MIDAS modules using Pentaho 5.0.6, enhancing data quality and streamlining cross-module integration

• Developed and refined Hive queries for advanced data analysis, delivering actionable insights and supporting data-driven decision making

• Orchestrated parallel environment testing between legacy and AWS systems, establishing robust validation protocols and ensuring data consistency

• Engineered complex data pipelines across MIDAS modules using Pentaho 5.0.6, optimizing data transformation workflows and enhancing system integration quality

• Architected automated XML parsing solutions using Spark framework, streamlining data ingestion processes and reducing manual processing overhead

• Led migration from Oracle to AWS Redshift, implementing strategic data transfer protocols while maintaining system performance and data integrity Programmer Analyst-Lead May 2011 – Feb 2014

MARRIOTT

• Extensively worked on Informatica Designer Components Source Analyzer, Warehouse Designer, Transformation Developer, Mapplet and Mapping Designer

• Designed and created complex source to target mappings using various transformations inclusive of but not limited to XML Transformations, Aggregator, Look Up, Joiner, Source Qualifier, Expression, and Router Transformations.

• Analysis and development of mappings using needed transformations using Informatica.

• Involved in testing various Informatica Mappings.

• Modified several of the existing mappings based on the user requirements and maintained existing mappings, sessions and workflows

• Monitored data warehouse weekly conversion to ensure successful completion.

• Used Korn shell scripts in automating various jobs and do the data validations.

• Involved in Creating Unit Test cases.

• Modified Function processes using Informix 4gl as front end.

• Developed SQL statements for querying and updating the databases.

• Analyzed and drilled down production issues to resolve them successfully.

Software Developer/Analyst Oct 2004 – April 2011

KROGER

• Modified several of the existing mappings based on the user requirements and maintained existing mappings, sessions and workflows.

• Monitored data warehouse daily/weekly jobs to ensure success.

• Automated the Informatica jobs using UNIX shell scripting.

• Prepared SQL Queries to validate the data in both source and target databases.

• Created System requirement specifications

• Created user entry forms using Informix 4gl as front end.

• Prepared user entry screens

• Analyzed and drilled down production issues to resolve them successfully.

• Assured Data Integrity after mass loading

• Involved in documentation and presentation.

• Involved in user training and support of the applications

• Designed and developed stored procedures in Informix

• Automate Informix and System process and procedures using shell scripts and Informix sql.

Software Consultant May 1999 – Oct 2004

Brandon Consulting Associates, Inc.

• Involved in System study, Analysis, user requirement and design of the company.

• Created System requirement specifications.

• Involved in Data Modeling - logical & physical design

• Identified the Entities and Attributes and built the Entity Relationship Diagrams using Erwin.

• Performed Database design in Informix Dynamic Server 9.3.

• Created user entry forms using Informix 4gl as front end.

• Developed SQL statements for querying and updating the databases.

• Prepared user entry screens

• Analyzed and drilled down production issues to resolve them successfully.

• Involved in documentation and presentation.

• Involved in user training and support of the applications.

• Designed and developed stored procedures in Informix

• Automate Informix and System process and procedures using shell scripts and Informix sql.

EDUCATION

• Bachelor of Science (Physics) from R.J. College, Mumbai (INDIA), 1992.

• Diploma in Computer Software Application from DataPro Computer Institute, Mumbai (INDIA), 1994.

CERTIFICATIONS

• Successfully Completed Web Academy conducted by Kroger/UC.

• Informatica 8.5 Completion Certificate - 2012

• ITIL Foundation Certified in 2013.

• AWS Certified Solutions Architect – Associate – April 2018.

• Booz Allen Hamilton - AI Enablement/Foundation Certification – April 2024

• Ontologize Certification – March 2025.

Completed Ontologize Training (Palantir Foundry & AIP Foundations).

• Databricks Data Engineer Associate Certification- April 2025. AWARDS

• For great dedication on the DoD Advana project - Champion's Heart – June 2020.

• Passionate Service - In Support of Advana - Nov 2020.

• Passionate Service - All star engineer- March 2025.



Contact this candidate