DHRUV CHAUBEY
** *-***-**** **********@*****.*** https://www.linkedin.com/in/dhruv-chaubey/ https://github.com/dhruvchaubey Summary
Data Scientist with mid-level experience in data engineering and data science projects. Successfully improved user engagement by integrating machine learning algorithms to personalize features and developed comprehensive reports to aid decision-making. Previous achievements include increasing data processing efficiency and ensuring data integrity for IBM clients. Looks to leverage this background in developing impactful data products that meet client needs and educate stakeholders in data-driven decision-making.
Education
New Jersey Institute of Technology Sep 2022 - May 2024 Master of Science, Data Science
New Jersey Institute of Technology Aug 2013 - Jun 2017 Bachelors degree
Work Experience
NeurotechR3 Feb 2024 - May 2024
ML Engineer-Capstone Project
Improved user experience and learning outcomes by 15% through deploying a recommendation system using machine learning algorithms, Python, and cloud technologies such as AWS, to analyze user performance data and tailor game settings.
Enhanced system performance by seamlessly integrating machine learning algorithms into AWS using S3 and SageMaker, leveraging data insights and improving user engagement metrics.
Increased actionable decision-making insights by 30% through the creation of comprehensive reports and data visualizations, utilizing statistical techniques to analyze user performance patterns. New Jersey Institute of Technology Sep 2023 - May 2024 Teaching Assistant
Provided academic mentorship and guidance to over 150 students through the integration of statistical techniques and data-driven approaches in physics labs, coursework evaluation, and educational support. IBM Apr 2018 - May 2022
Software Engineer Analyst
Achieved a 10.5% increase in conversion rates, quantified by customer engagement, by designing data models to analyse IBM customers’ behaviours.
Accomplished a 15% improvement in data processing efficiency, calculated by processing speed, by optimising the client’s application tracking system (ATS) using SQL queries and Python scripts.
Ensured 99% data integrity, gauged by successful validation checks, by executing an ETL workflow using Azure DataBricks for data cleansing, transformation, and validation.
Managed the implementation of an e-commerce platform, achieving 95% operational efficiency, by integrating payment systems, linking IBM to third-party services via APIs, and developing a custom ERP for inventory, sales, and reporting for a banking client.
Performed the deployment of 6 application modules, checked by their successful operation in a live environment, using IBM Cloud’s IaaS for a client’s application.
Projects
Stock Price Trading at the close Oct 2023 - Dec 2023
Tailored a machine learning model to predict closing price movements for Nasdaq-listed stocks, leveraging order book data and closing auction dynamics to identify trading opportunities.
A statistical model adeptly handles extensive datasets, utilising order book and closing auction data for scalability and optimal performance, achieving a 5% increase in accuracy as measured by the MAE metric. Red Wine Quality- EDA and Classification Mar 2023 - Mar 2023
Formulated a classification model to evaluate red wine quality based on a comprehensive dataset containing 12 distinct features.
Applied Python’s robust Machine Learning Algorithm to fastidiously prepare and train the predictive model, achieving an exceptional accuracy rate of 94%, signifying its capability to predict wine quality accurately. COVID-19 Vaccination Program Dec 2022 - Dec 2022
Developed and operated real-time monitoring and tracking Tableau dashboard for key performance indicators(KPIs), monitoring COVID-19 vaccination progress across 100+ countries. Skills
Programming Languages: Python, R, SQL, Scala, C#, Snowflake
Machine Learning/Statistics: Regression, Classification, Neural Networks, Statistical Analysis, Regularizations, Statsmodels, Machine Learning Algorithms, Statistical Techniques
Framework: Django, TensorFlow, Keras, Hadoop, BeautifulSoup, Spark, PySpark, XGBoost
Tools: AWS, Tableau, IBM Watson Studio, GIT, PowerBI, Terraflow
Key Skills: Data Visualization, Predictive Analysis, Statistical Modeling, Clustering & Classification, Data Analytics, Big Data Analytics, Data Modeling, CI/CD, Web Scrapping, Algorithms, Schema design, Applied Mathematics, Artificial Intelligence Concepts
Cloud Technologies: Cloud Technologies