Post Job Free

Resume

Sign in

Science Intern Data Scientist

Location:
Lansing, MI, 48933
Salary:
30 USD per hr
Posted:
March 22, 2023

Contact this candidate

Resume:

SAUMYA SHAH

DATA SCIENTIST

Searching for Summer and Fall 2023 Internship roles!

adv2ss@r.postjobfree.com • 517-***-**** • linkedin.com/in/5aumya-5hah/ • github.com/snshahgit • Michigan, USA Consistent • Promising learning curve • Higher stress thresholds • Exhibits professional integrity • Public Speaker • Humble • Grateful • Realist Education

Master of Science (MS) in Data Science September 2022 – May 2024 Michigan State University, MI, USA

Relevant Coursework: Applied statistical modelling, Big Data Analytics, Data Mining, Computational Optimization Bachelor of Technology (BTech) in Computer Engineering July 2018 – July 2022 Pandit Deendayal Energy University, Gujarat, India Relevant Coursework: Data Structures and Algorithms, Linear Algebra (Mathematics III), Data Visualization, Web Development Experience

Graduate Research Assistant – III September 2022 – Present Imaging and Deep Learning Lab, Michigan State University, East Lansing, MI

• Collaborated in research on deep learning models for disease detection and structural analysis from human biopsy extracts.

• Migrated the simulated data generation scripts from MATLAB to Python.

• Python packages such as – Scipy, Stats models, Numpy, Pandas, and TensorFlow, were used to build a CNN-based autoencoder architecture to process

(denoise) the simulated spectral image data.

• Baseline modelling results indicate an F1-score of 87%. Machine Learning Intern January 2022 – August 2022 Seaflux Technologies Pvt. Ltd., Ahmedabad, Gujarat, India

• Constructed a product recommendation system for an E-commerce application using an ensemble of 3 base algorithms — collaborative filtering, content- based filtering (NLP), and associate rule matching (market-basket analysis).

• Assisted the ML team in achieving a scale of 5000+ products, 80+ categories, and an average monthly revenue of $10000 (800K INR).

• Automated a ML pipeline to showcase a supply chain dashboard (daily orders forecast) to the merchants on the app. It consisted of a time-series model

(SARIMAX) deployed in conjunction with AWS Step, Lambda function triggers. Data Science Intern July 2021 – December 2021

Kyra Solutions Inc., Tallahassee, FL

• Programmed computer vision scripts to capture geo-fenced video footage, and snapshots (YOLOv5) of traffic violation incidents. This enabled tracking of vehicles (DeepSORT algorithm) as well as optical character recognition (OCR) of the number plates with a true positive rate of 78%.

• Deployed the computer vision scripts for Florida Department of Transport (FDOT) using technologies such as micro-web services (FastAPI) and MLOps

(Docker, AWS Canary deployment). Results of A/B testing for Tallahassee, FL demonstrated an increase of 120-200 traffic violation incidents per day. Projects

• Telecom Customer Churn Modelling February 2023

o This project diagnostically predicts whether a telecom company customer plans on continuing/subscribing the service for next term. o Customer big data used to train the ML model includes gender, marital status, different types of services, type of contract, streaming preferences and the charges incurred by the customer. Results can be used as inputs for dynamic pricing system to offer great deals to retain the customer.

• AI-based Credit Approval System December 2022

o This project helps in approval of credit application of a bank client. Client data used to train ML model includes employment information, previous defaults, industry of work, age, credit score, debt, and income. o This model confirms that the decision is not made from sexism, racism, or other forms of discrimination including locality (zip code), ethnicity, and last name (religion).

Skills

Coding Languages: Python JavaScript R Language Julia C++ SQL MATLAB PHP Technologies: Machine Learning Deep Learning (Tensorflow) AWS Step, Lambda Web scraping git PySpark Docker Power BI Laravel Certifications: IBM Data Science NPTEL Deep Learning 100 days of ML code challenge Interests: Social Networks Recommendation Systems Pricing models Pattern Recognition Derivative Markets Co-curriculars: Proficient at googling Flexible with Linux Learning Japanese Amateur at graphic designing (Adobe illustrator, InDesign) Volunteering: Teaching poor kids during summers Organizing blood donation drives/camps Saxophone and Harmonica player at college band



Contact this candidate