Shubham G. Kanshetti
Tel: +91-976*******
Email: ***********@*****.*** LinkedIn: www.linkedin.com/in/sshubham913 Objective
Seeking a Data Science role to enhance the hands-on experience in Data Science. It is my goal to make a difference to the organization I would be part of thereby contributing to their growth and success. Education
• B.E in Computer D.Y.Patil COE, Pune June 2018
• Post Graduate Program in Data Science International School of Engineering (INSOFE) November 2019 Certification
• Post Graduate Program in Data Science International School of Engineering (INSOFE) Certified by the Language Technologies Institute (LTI) in the School of Computer Science at Carnegie Mellon University, USA
• Predictive Business Analytics Edupristine
• Machine Learning A-Z: Hands-On Python & R In Data Science Udemy certification
• Introduction to Oracle 11g Seed Info-tech
Work Experience
Data Scientist with Blocklogy, Navi Mumbai February 2020 - Present Major project-
1. Recommendation engine for Platforms
• Build Collaborative based recommendation model for App and Block chain platforms.
• Analyzing data on platforms, helping to improve product efficiency through AI.
• Created Contents in Machine learning, Statistics, Artificial intelligence for e-learning platform called Era swap academy.
• Worked on Database and analyzing the data.
Intern Data Scientist with Singularity AI Labs, Gurugram December 2019 - January 2020 Major project-
1. Revenue optimization for Hospitality Domain
• Building data quality analysis and Property Performance report scripts.
• Anomaly analysis scripts for competitors property data.
• Detection wrongly mapped properties from different channel wise detection.
• Worked on Database and analyzing the data by Occupancies and Public Prices.
• Used Tableau for analyzing price movement dashboard. Intern Data Scientist with Climate Connect Technologies, Pune February 2019 - May 2019 Major projects-
1. Predictive Model for Grid Load Forecasting
• Predicted short term unrestricted energy demand on a day ahead and an intra-day basis for a Major Utility in a Tier 1 city.
• Liaison with multiple stakeholders on Domain data to important features that are important for modeling.
• Differentiate the key characteristic and relation between weather and load, focused on predicting from past load trends along with numerous other parameters.
2. Predictive Model for Solar Power generation Forecasting
• Predict day ahead solar power generation forecasts for several solar farms for a major energy firm.
• Overcame major challenges like sub-par solar forecast accuracies.
• Used an ensemble model that finally achieved the best accuracy, exceeding client expectations. Achievement
• Applauded and appreciated for the deployment of the Forecasting model of Solar Power Generation. Hackathon
1) Predicting the alpha signal using microblogging data (Project Hackathon Defense, INSOFE)–
• Performed data visualization & EDA on the data gathered.
• Applied feature engineering, removal of unwanted columns, cleaning text data.
• Converting Emoticons, Contractions, and hashtags into text as feature engineering.
• Build machine learning model to predict alpha from stock factors and sentiment analysis model on the parsed data.
• Used the sentiment scores as a factor, along with stock factors to predict alpha.
• Applied Logistic, Random forest, Decision trees, XgbBoost, MLP, Embedding, and TF-IDF.
• Accuracy metrics was f1 macro score.
• Python language was chosen as the programming language. 2) Predicted the Flat Resale Prices in Singapore (Mid-Term Hackathon, INSOFE)–
• Performed data visualization & EDA on the data gathered.
• Applied feature engineering, Dummification, removal of unwanted columns, cleaning data.
• Applied Linear Regression, Random forest, Decision trees, Xgboost.
• Ensemble techniques like stacking gave the best accuracy.
• Python language was chosen as the programming language. 3) Capstone Project (Edupristine): Building a model for FOREX prediction. Description: Build a model to predict USD to INR conversion rate within +/-0.25 INR accuracy level. It contains the number of missing values daily, monthly, quarterly and yearly. Implementation:
• Data Preprocessing like Missing Value Imputation, Outlier treatment, feature Engineering, etc.
• Algorithms: Multi-Linear Regression or Time Series Analysis like Triple Exponential Smoothing Model or ARIMA Model.
• Final Model description, Validation, and Accuracy of the model like RMSE, MAPE, and accuracy. Data Visualization in R Studio, Excel charts and Graphs.
Achievement
• Awarded Merit Excellence in Edupristine.
Self-Driven Data Science Projects
1) Built Classification model for:
• Prediction of income level (Adults Analysis)
• Prediction of prescribed medication requires pre-authorization.
• Applied Logistic regression, Naïve Bayes, SVM, KNN, Decision Trees, Random Forrest 2) Built Regression models for:
• Prediction of Sales on Black Friday (Analytics Vidhya).
• Prediction of Taxi Fare Prediction
• Applied Multi-Linear Regression, Regularization, Boosting Techniques 3) Built Natural Language programming models for:
• Twitter Sentiment Analysis (Analytics Vidhya).
• Web scrapping and analysis of Indian bloggers.org
• Applied Spacy, NLTK, Textblob.
4) Built Neural Networks models for:
• Analysis on MNIST dataset, Women’s clothing review.
• Image Classification of Dog, Cat, and Horses & Humans dataset.
• Applied Glove, Word Embedding with Keras.
• Applied MLP, CNN, Auto encoders.
Technical Skills
Languages Python, R, SQL, NLP, Spark
Machine Learning algorithms Supervised algorithms: Linear regression, Logistic regression, KNN, Decision trees, Random forest, SVM, Boosting Techniques.
Unsupervised algorithms: Association rule mining, Collaborative filtering, K-Means.
Neural network Algorithms MLP, RNN (LSTM), CNN.
Tools Anaconda, Postman, Putty,Hadoop, Linux, PowerPoint, Tableau, Power BI, Microsoft Excel, Oracle 11g
Achievement
• Was the beneficiary of the Tuition Fee Waiver Scheme (TFWS) Scholarship from Maharashtra Gov. in B.E.
• Secured First Class with Distinction in Bachelor of Engineering.
• Secured Certificate of Excellence in Predictive Business Analytics from Edupristine.