Post Job Free

Resume

Sign in

python, sql, Machine Learning Data Science

Location:
Hinganghat, Maharashtra, India
Posted:
August 08, 2023

Contact this candidate

Resume:

Shubham Kadu

Data Science enthusiast with skills in statistics, as well as practical experience in data analytics, data cleaning, data visualization, machine learning, problem-solving, and creative thinking. Proficient in handling large datasets and prescriptive analysis, as well as scripting languages including Python and SQL. With a team-oriented attitude, I am eager to contribute my abilities to enhance my knowledge in the field of Data Science. adysyi@r.postjobfree.com 901******* linkedin.com/in/shubham-kadu16 github.com/shubhsk98 TRAINING EXPERIENCE

Data Science Trainee

AlmaBetter

06/2022 - Present, Bengaluru, Karnataka

Acquired proficiency in Python, SQL, Machine Learning, Tableau, and Power BI. With a strong foundation in programming and a deep understanding of data analysis and visualization techniques.

Acquired skills like Data Cleaning, Processing, Analysis, Visualization, Machine Learning & NLP. Hands-on experience on projects such as Regression, Classification, and Unsupervised.

Secured a place in the top 5% of students in the cohort of 450 students, and got selected for the prestigious Star Student Program. PROJECTS

Netflix Movies and TV Shows Clustering

AlmaBetter Verified Project

11/2022 - 12/2022,

The recommendation system of Netflix shows you movies and tv-show according to your interest. This model helps Netflix recommend personalized content to its users. Performed EDA, Data Cleaning, and Data Pre-Processing like removing stop-words and punctuations and NLP by applying TF-IDF & count Vectorizer Built a K-Means clustering model for clustering the same type of content. Used the Elbow method and Silhouette score to find the number of clusters. Analyzed the sentiments of the Netflix Movies and TV Shows Clustering and understanding recent year trends of Netflix content in different countries. Credit Card Default Prediction

AlmaBetter Verified Project

10/2022 - 11/2022,

One of the biggest threats facing banks now is credit card defaults. Can we reliably predict who is likely to default? If so, the bank can avoid losses by offering alternative options to customers.

Carried out feature selection and understand the impact of features. After Implementing SMOTE to handle the imbalance dataset. Developed a binary classification model using algorithms such as Logistic Regression, SVM, and Decision Tree to predict whether a customer will default on credit card payments. Evaluated model using confusion matrix, precision, recall, and AUC-ROC.

From the model, the Random Forest performed well with a Recall value is 90.46% Yes Bank Stock Closing Price Prediction

AlmaBetter Verified Project

09/2022 - 10/2022,

The stock price of YES bank actually fell from 2018 onward. Owing to this fact, it's interesting to see how that impacted the stock prices of the company and whether predictive models can do justice to such situations. Performed a thorough examination of the data through exploratory analysis, Data Preprocessing, and also handled outlier detection, and missing value. Built a regression model using Linear, Lasso, Ridge, and Elastic Net models to predict the closing price of Yes Bank for the next month. evaluated matric using R2, and Adjusted-R2.

In spite of all models, Ridge Regression performed best with R2 scores of 95% TECH STACK

Expertise in Languages & Tools (x/5)

Python-4.7 SQL-4.5 Excel-4.1 Power BI-4.3 Looker Studio-4.0 Tableau-4.3 GitHub-4.0

ML Algorithms

Linear Regression, Logistic Regression, Decision Tree, Random Forest, SVM, KNN, Neural Network, XG Boost, K-means Clustering, NLP, PCA, Recommender system.

Platforms

Jupyter Notebook, PostgreSQL, Google Colab, Google Data Studio, Tableau, Power BI, VS Code.

Libraries and Frameworks

Scikit-Learn, Pandas, NumPy, Seaborn, Matplotlib,

NLTK, TensorFlow, Keras, Computer Vision, Flask

ACHIEVEMENTS

Gold Badge in Python & SQL (01/2023)

HackerRank 455 Points in Python and 670 Points in SQL Python (Basic) Certificate (12/2022)

HackerRank

RELEVANT COURSEWORK

Full Stack Data Science (06/2022 - Present)

AlmaBetter

Python - A to Z Full Course for Beginners

(01/2022 - 03/2022)

Udemy

PUBLICATIONS

Medium Blogs

Machine Learning Use Cases in ‘E-commerce’

2023

EDUCATION

M.Sc in Statistics

Rashtrasant Tukadoji Maharaj Nagpur

University

CGPA: 7.76 2019 - 2021

B.Sc in (PCM) Group

Rashtrasant Tukadoji Maharaj Nagpur

University

Percentage: 56% 2016 - 2019

INTERESTS

Travelling Gym Defense News

Tags: Python, Excel, SQL, Tableau, Power BI, Machine Learning, Presentation Skills Tags: Unsupervised Learning, K-mean, Elbow method, Silhouette score, NLP, PCA, Clustering Tags: Logistic Regression, Decision Tree, Random Forest, XG-Boost, KNN, ROC-AUC Tags: Linear Regression, Lasso, Ridge, Elastic Net, EDA



Contact this candidate