DEEPTI PATIDAR
**************@*****.*** 945-***-****
SUMMARY
Purpose driven engineer specializing in ETL development and data analytics. I am motivated to apply hands-on data engineering expertise, with consistent high-quality work in addressing data-centric challenges and storytelling with a strong work ethic. SKILLS
Programming: Python (Pandas, PySpark), SQL, Shell Scripting, JavaScript, Prompt Engineering Tools: Databricks, AWS (S3, DMS, RDS), Apache Spark, Informatica, Oracle, Tableau, MySQL, Hadoop, Vertica, Excel, Jenkins, Gitlab Business Intelligence & Analytics: ETL/ELT, Reporting, CI/CD, Data Warehousing, Data Analysis, Data Visualization, KPI Tracking Data Management: Data Profiling, Data Modelling, Data Integration, Data Governance, Data Migration Project Management: Jira, Confluence, Agile, Scrum, Waterfall, SDLC, SharePoint, MS Office Suite EXPERIENCE
Data Engineering Intern – Vendors 2 Retailers July 2025 - Present
• Preprocessed and profiled a 10+ GB surgical dataset to ensure high-quality structured data for robust AI model training.
• Created and validated AI models in Python to accurately predict medication effectiveness and achieve data solutions and validation accuracy in predicting outcomes.
• Integrated and transformed diverse patients’ data into a single, comprehensive dataset, by creating data pipelines leading to 98% accurate predictions, pipeline building and scalable analysis.
• Identified the most important data points using SQL queries in MySQL for predictions using data and predictive analysis techniques, making models more efficient.
ETL Developer – Onit Inc. Jan 2020 - Mar 2022
• Automated workforce data integration pipeline feeds by developing and refining data models on MySQL, resulting in improved functionality utilizing informatica and SQL queries.
• Streamlined ETL processes using Tableau and Informatica to design automated cloud workflow, cutting workload hours by 40%.
• Led the development of the Engagement data transformations and Dashboard, facilitating detailed analysis of user activity and comprehensive engagement metrics, which contributed to a 15% increase in product pillar adoption.
• Efficiently handled numerous data issues utilizing Databricks, and data transformation and mining techniques for accrual reversal data, consistently orchestrating jobs utilizing Jenkins.
• Implemented a robust data strategy for real-time price changes and currency conversions and designed a detailed dashboard for vendors showcasing key metrics, including a 95% increase in ARR growth post-conversion and an additional seat count increase of 0.09%.
• Functioned as a techno-functional resource, conducted business analysis, provided design, and provided data-driven solutions to manage Repayment Schedule for customers, thereby reducing errors by up to 8.89%. ETL & Database Developer – NetLink Softwares Jul 2018 - Dec 2019
• Developed and optimized a data architecture for 5-Year Sales Plan application with detailed sales and volume breakdown logic.
• Implemented ETL pipelines using Python and Informatica to process over 20 million OEM records, managed data migrations and data transformation, and improved efficiency in data loading into MySQL and Vertica based large-scale data warehouses.
• Designed cloud workflows, business rules, and action items for historical trends to enhance forecasting accuracy by 70% and cut data processing time by 10%.
• Integrated multiple data sources including Salesforce and other cloud platforms and ensured smooth data profiling across staging, dimension, fact, and aggregate tables of the Snowflake and constellation schema.
• Performed root cause analysis, ensuring 100% data accuracy and resolving discrepancies in Original Equipment Manufacturer group details.
• Conducted data validation, logging, monitoring and error-handling systems for creation of Tableau, Excel reports and dashboards.
Data Analyst – HotWax Systems Aug 2017 - Jun 2018
• Optimized data collection and enhanced automated ETL data pipelines (KPI tracking, business rules, etc.) on Informatica and Oracle, executed different external requests via vendors, thereby minimizing the dissatisfaction rate by 9.7%.
• Collaborated with internal & external partners to facilitate data collection from multiple sources including MySQL, data integration, reporting, and analysis of data by crafting dashboards using Tableau, decreasing price incurred by at least 7.53%.
• Successfully consolidated a one-stop shop for 20+ KPIs into a single dashboard, streamlining accessibility for stakeholders.
• Standardized, scheduled, and monitored ETL processes using Tableau and MySQL to develop automated data pipelines, thereby reducing man-hours by 50%.
• Mentored the front-end development team on digital integration with the CRM tool, data quality assurance, data security and cross-collaborated with the stakeholders on evolving issues related to business metrics and financial operations. FREELANCE PROJECT
Developer – Freelance Support Sept 2022 - Jul 2023
• Performed comprehensive data preprocessing and Extract, Transform, Load (ETL) operations on a large-scale healthcare dataset that included handling missing values through data imputation (mean or interpolation) and outlier detection and removal.
• Conducted data aggregation utilizing Tableau, AWS (RDS, S3, DMS) and Databricks to convert granular, hourly time-series data into daily metrics, segmented by patient severity and location, to prepare for predictive modeling.
• Drove SDLC best practices in Agile/DevOps environments, managing LOE estimates, code reviews, job dependencies, and production support for reliable, maintainable data operations.
• Assessed machine learning model performance using common metrics: Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Root Mean Square Error (RMSE). This assessment helps with ongoing model improvement and selection.
• Showed that SARIMA's strength is its ability to break down time series into trend, seasonality, and residuals. This breakdown was essential for data modeling on MySQL for patient arrival patterns that show cyclical and non-linear behavior. EDUCATION
Master’s in engineering management
University of North Carolina • Charlotte, NC • 2023 - 2024 Master’s in computer science
Devi Ahiliya Vishwa Vidhyalaya • Indore, India • 2016 - 2018 Bachelor’s in computer science
Rajiv Gandhi Proudyogiki Vishwavidyalaya • Indore, India • 2012 - 2016 CERTIFICATIONS
• Microsoft MTA: Advanced Database
• Advanced SQL – Udemy
• Consulting Management – LinkedIn Learning
• Client Communication – LinkedIn Learning
• Linux Foundation Certification – IIT Bombay
• Workshop: Machine Learning – DAVV Indore
• Prototype Competition: IOT based forest fire prevention – Axelta Systems