Shubham Kadu
Data Science enthusiast with skills in statistics, as well as practical experience in data analytics, data cleaning, data visualization, machine learning, problem-solving, and creative thinking. Proficient in handling large datasets and prescriptive analysis, as well as scripting languages including Python and SQL. With a team-oriented attitude, I am eager to contribute my abilities to enhance my knowledge in the field of Data Science. adysyi@r.postjobfree.com 901******* linkedin.com/in/shubham-kadu16 github.com/shubhsk98 TRAINING EXPERIENCE
Data Science Trainee
AlmaBetter
06/2022 - Present, Bengaluru, Karnataka
Acquired proficiency in Python, SQL, Machine Learning, Tableau, and Power BI. With a strong foundation in programming and a deep understanding of data analysis and visualization techniques.
Acquired skills like Data Cleaning, Processing, Analysis, Visualization, Machine Learning & NLP. Hands-on experience on projects such as Regression, Classification, and Unsupervised.
Secured a place in the top 5% of students in the cohort of 450 students, and got selected for the prestigious Star Student Program. PROJECTS
Netflix Movies and TV Shows Clustering
AlmaBetter Verified Project
11/2022 - 12/2022,
The recommendation system of Netflix shows you movies and tv-show according to your interest. This model helps Netflix recommend personalized content to its users. Performed EDA, Data Cleaning, and Data Pre-Processing like removing stop-words and punctuations and NLP by applying TF-IDF & count Vectorizer Built a K-Means clustering model for clustering the same type of content. Used the Elbow method and Silhouette score to find the number of clusters. Analyzed the sentiments of the Netflix Movies and TV Shows Clustering and understanding recent year trends of Netflix content in different countries. Credit Card Default Prediction
AlmaBetter Verified Project
10/2022 - 11/2022,
One of the biggest threats facing banks now is credit card defaults. Can we reliably predict who is likely to default? If so, the bank can avoid losses by offering alternative options to customers.
Carried out feature selection and understand the impact of features. After Implementing SMOTE to handle the imbalance dataset. Developed a binary classification model using algorithms such as Logistic Regression, SVM, and Decision Tree to predict whether a customer will default on credit card payments. Evaluated model using confusion matrix, precision, recall, and AUC-ROC.
From the model, the Random Forest performed well with a Recall value is 90.46% Yes Bank Stock Closing Price Prediction
AlmaBetter Verified Project
09/2022 - 10/2022,
The stock price of YES bank actually fell from 2018 onward. Owing to this fact, it's interesting to see how that impacted the stock prices of the company and whether predictive models can do justice to such situations. Performed a thorough examination of the data through exploratory analysis, Data Preprocessing, and also handled outlier detection, and missing value. Built a regression model using Linear, Lasso, Ridge, and Elastic Net models to predict the closing price of Yes Bank for the next month. evaluated matric using R2, and Adjusted-R2.
In spite of all models, Ridge Regression performed best with R2 scores of 95% TECH STACK
Expertise in Languages & Tools (x/5)
Python-4.7 SQL-4.5 Excel-4.1 Power BI-4.3 Looker Studio-4.0 Tableau-4.3 GitHub-4.0
ML Algorithms
Linear Regression, Logistic Regression, Decision Tree, Random Forest, SVM, KNN, Neural Network, XG Boost, K-means Clustering, NLP, PCA, Recommender system.
Platforms
Jupyter Notebook, PostgreSQL, Google Colab, Google Data Studio, Tableau, Power BI, VS Code.
Libraries and Frameworks
Scikit-Learn, Pandas, NumPy, Seaborn, Matplotlib,
NLTK, TensorFlow, Keras, Computer Vision, Flask
ACHIEVEMENTS
Gold Badge in Python & SQL (01/2023)
HackerRank 455 Points in Python and 670 Points in SQL Python (Basic) Certificate (12/2022)
HackerRank
RELEVANT COURSEWORK
Full Stack Data Science (06/2022 - Present)
AlmaBetter
Python - A to Z Full Course for Beginners
(01/2022 - 03/2022)
Udemy
PUBLICATIONS
Medium Blogs
Machine Learning Use Cases in ‘E-commerce’
2023
EDUCATION
M.Sc in Statistics
Rashtrasant Tukadoji Maharaj Nagpur
University
CGPA: 7.76 2019 - 2021
B.Sc in (PCM) Group
Rashtrasant Tukadoji Maharaj Nagpur
University
Percentage: 56% 2016 - 2019
INTERESTS
Travelling Gym Defense News
Tags: Python, Excel, SQL, Tableau, Power BI, Machine Learning, Presentation Skills Tags: Unsupervised Learning, K-mean, Elbow method, Silhouette score, NLP, PCA, Clustering Tags: Logistic Regression, Decision Tree, Random Forest, XG-Boost, KNN, ROC-AUC Tags: Linear Regression, Lasso, Ridge, Elastic Net, EDA