Post Job Free
Sign in

Data Service

Location:
San Francisco, CA
Posted:
June 10, 2016

Contact this candidate

Resume:

Sakshi Bhargava

¬ +1-415-***-**** * *********@****.*****.*** sbhargava024

Education

Masters in Analytics July 2015-June 2016(expected) University of San Francisco, San Francisco, CA, GPA: 3.9/4.0 Courses: Machine Learning, Time Series Analysis, NOSQL Databases, Linear Regression, Relational Databases, Data Acquisition, Exploratory Data Analysis, Computation for Analytics, Probability and Statistics, Data Visualization, Distributed Computing, Supply Chain Analytics, Design of Experiments, Web Analytics

Bachelor of Electronics and Communication Engineering Sept 2008–June 2012 IES IPS Academy, India, GPA: 3.6/4.0

Technical Skills

Spark, Python, R, PostgreSQL, AWS, Tableau, D3, SAS, VBA, Teradata, Learning Hadoop and Hive

Machine Learning, Text Analysis, Time Series Analysis, Predictive Modeling, Data Mining, Image Analysis Professional Experience

Data Science Intern November 2015–Present

Williams-Sonoma Inc. (Retail, e-commerce) San Francisco

– Enhanced accuracy of image classification process from 90% to 99%

– Applied graph cut and mean shift algorithm to identify object in an image and predict its color with 91% accuracy

– Recognized patterns in an image using convolutional neural network with 82% accuracy

Senior Business Analyst October 2012–April 2015

Mu Sigma Business Solutions Pvt. Ltd. (Advanced Analytics and Decision Sciences) Bangalore, India Pattern Analysis of Regulatory Visits at Pharmacy Stores

– Predicted likelihood of inspections of stores using logistic regression model. Performed pre-post analysis, hypothesis testing and k-means clustering

– The model had 75% accuracy and had led to 10% reduction in fines due to non-compliance Sales Driver Analysis and Sales Forecasting for Spare Parts of Home Appliances

– Developed regression model to determine the various drivers a ecting orders of spare parts for distribution channels

– Built forecast model using ARIMA to predict orders of spare parts. Automated process using SAS and VBA Import Inventory Management for Retail Stores

– Forecasted import demand using decomposition time series modeling techniques (SAS).

– Prediction methodology improved demand forecast accuracy by 18% Compliance Performance Scorecard

– Developed scorecard in Tableau by defining metrics and their thresholds to assign compliance scores to pharmacies Awards. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Awarded for ’Pattern Analysis of Regulatory Visits’ project for Leading the project Providing thorough and rigorously tested analytical solutions Hard work and dedication Course Projects

Scalable Yelp Recommendation Service: Built a web app to recommend businesses to Yelp users. Used Spark and applied Matrix Factorization, Random Forest, Gradient Boosting models and feature engineering to predict business ratings. Improved RMS error by 8% over the baseline model of average ratings

Kaggle’s "What’s Cooking" Competition by Yummly: Built multiclass classifier using python, to predict cuisine of a recipe, given a list of ingredients. Built an ensemble of logistic regression, random forest and gradient boosting models and achieved accuracy of 80.99%. Finished amongst top 10% (approx 1400 teams)

REST Web Service: Amazon Beauty Products Data API: Created a RESTful web service (hosted on EC2) using Python Flask to retrieve data, stored in AWS RDS. Performed ETL on json data using python and PostgreSQL

Retweet Analysis: Analyzed factors that lead to retweeting and developed a regression model to predict the number of retweets using Python and PostgreSQL, pulling data from the Twitter REST API

Sentiment Classifier: Classified movie reviews as positive or negative with 82% accuracy, by implementing self-coded Naïve Bayes algorithm in both python and pyspark. Enhanced accuracy up to 90% by adding a subjectivity classifier



Contact this candidate