Post Job Free
Sign in

Data Engineer

Location:
Jersey City, NJ
Posted:
April 29, 2025

Contact this candidate

Resume:

Mansha Manchanda

***************@*****.*** +1-718-***-****

EDUCATION

Columbia University New York, NY

Master of Science in Applied Analytics, GPA: 3.8/4.0 Jan 2022 - May 2023

● Relevant Coursework: Managing Data, Storytelling with Data, Machine Learning, Natural Language Processing SRM Institute of Science and Technology Chennai, IN Bachelor of Technology in Computer Science and Engineering May 2019

● Relevant Coursework: Data Structures & Algorithm, Algorithms, Database Management, Analysis of Algorithm SKILLS

● Programming Languages: Python, Java, Shell scripting, SQL

● Big Data Frameworks: Spark, Hadoop

● Cloud Technologies: Azure- Databricks, Data factory, Synapse, ADLS, Blob, Functions, Event Hubs, Cosmos DB AWS- Glue, Lambda, S3, Redshift, Athena, IOT, DynamoDB, Quick sight, Kinesis, EMR

● CI/CD & Tools: Tableau, Looker, Power BI, Airflow, Databricks, Snowflake Jenkins, SVN, Git,Terraform

● Databases: Oracle, Postgres, MySQL NoSQL, MongoDB, ElasticSearch, Opensearch EXPERIENCE

ADP New York Meropolitan Area

Data Engineer June 2023 - Present

● Spearheaded the Prevailing Wage data extraction project by leveraging cutting-edge Gen AI models (GPT-4, Llama, Claude v2) to compile comprehensive unstructured wage datasets, creating opportunities for future data monetization.

● Engineered a scalable data warehouse on redshift that integrated multiple data sources, reducing processing time by 15% and significantly enhancing system capacity to handle increasing data volumes efficiently.

● Led the development of large scale ETL pipelines for 6 products at ADP on the AWS, Kafka and Databricks platform, utilizing Delta Live Tables for data processinag, with deployment managed through Jenkins for streamlined operations.

● Achieved a $84,000 annual reduction in AWS storage costs by advanced compression techniques and identifying underutilized data tables, driving significant cost efficiencies.

● Collaborated closely with business stakeholders to gather requirements and implement Onedata data architecture standards, ensuring alignment with organizational goals and documenting best practices on Confluence for knowledge sharing.

● Implemented an API on AWS Lambda and REST API Gateway to retrieve data from OpenSearch, enhancing data accessibility and enabling seamless integration with other systems and applications, supporting scalable and efficient data retrieval processes. Columbia University New York, NY

Data Engineer Sep 2022 – May 2023

● Initiated and completed a comprehensive data analysis project that harnessed SQL querying on extensive student records, resulting in the creation of automatic visualization tools that improved academicresource management

● Spearheaded an AI based tool for developing a recruiterassistance software, using BERT and LLM algorithms, which facilitated the seamless matching of job posts with qualified student profiles

● Dashboard Creation: Designed and implemented interactive dashboards using Power BI and Tableau, enabling real-time data insights for stakeholders across multiple departments.

KBRA – Kroll Bond Rating Agency New York, NY

Data Engineer Intern Jan 2023 – April 2023

● Designed and implemented data validation pipelines for credit risk loans data to ensure data integrity. Validated and cleaned 1,000,000+ records per day as part of the data ingestion process.

● Reduced data-related errors by 25%, leading to higher data quality and trust in the data lake.

● Optimized & refactored an existing large code base. Significantly improved process efficiency, reducing execution time by 4 seconds. ZS Associates New Delhi, IN

Data Engineer, Business Technology Analyst Jul 2020 – Dec 2021

● Created real-time dashboards and reports using SQL and Tableau to provide sales teams with key performance indicators (KPIs) and customer data insights, ultimately enhancing quota development

● Utilized advanced tools and platforms such as Hadoop, Spark, and Databricks to manage and process large-scale datasets. Developed and optimized data pipelines, contributing to the scalability and efficiency of data workflows.

● Partnered with cross-functional teams to gather and translate requirements into practical data-driven solutions. Ensured alignment of data strategies with key product decisions and ROI analysis

● Developed predictive models using customer data to boost sales engagement by 24%. Additionally, identified key market segments through data analysis, leading to a 10% sales increase with the launch of a new product. Infosys Pune, IN

Software Engineer Aug 2019 – Jun 2020

● Built functionality to block credit and debit cards in a local or a foreign region using Java, and SQL. Devised functionality provides a sturdy solution for events where an individual card is stolen or lost or a malicious transaction is executed

● Formulated an e-pin generation functionality for credit, debit, and prepaid cards using JavaScript, HTML, CSS, and Node.js to expedite the e-pin generation process and enhance customer experience



Contact this candidate