Sign in

Data Analyst

Long Island City, NY
September 12, 2018

Contact this candidate


Yi Shi

**** **** **. ***, **** Island City, NY 917-***-**** Education


Master of Science, Applied Analytics, GPA: 3.7/4.0 Aug. 2016 – Dec. 2017

Related courses: Machine Learning, Data Analytics Using SQL, Applied Analytics Frameworks and Methods WUHAN UNIVERSITY WUHAN, CHINA

Bachelor of Science with Honors, Mathematics and Applied Mathematics Sep. 2011 – Jun. 2015

President of WHU Student Union, Honorable Mention of Mathematical Contest in Modeling, Leading Director of Drama Club Skills

SQL PostgreSQL R Python Excel Tableau Salesforce JMP SPSS C SAS Venngage Infogram Project Experience

Neural Network and Deep Learning based Handwriting Recognition Apr. 2017

Built a 10-class classifier using Artificial Neural Networks to recognize handwritten images

Minimized the size of training data by randomly generating 2800 new training data

Structured H2O library model with 3 hidden layers in R, obtaining 94% testing accuracy on the trained network Spam Email Classification Feb. 2017

Compared the performances of traditional supervised learning methods, such as Logistic Regression, Linear Discriminant Analysis, K-Nearest Neighbors and Support Vector Machine to binary classification of spam emails

Ranked the algorithms based on accuracy, sensitivity, specificity, precision and running time, concluded Logistic is the best SQL Data Analytics – Grocery Purchase Data in US Jul. 2017

Tackle complex business problems by converting raw data from relational databases into actionable business strategies

Developed customer survival analysis to analyze active subscribers and stop rate to improve customer acquisition and retention

Limited the wealthiest zip codes based on household income, and analyzed incomplete orders to make business decision Predictive Modeling for Movie Recommendation Dec. 2016

Designing maximum likelihood algorithm to handle missing data to pre-processed IMDb movie dataset

Applied Principal Component Analysis to reduce data dimension and to determine predictors in JMP

Concluded Random Forest explains more variability than that of Multiple Linear regression based on R-Square to forecast the popularity scores of movies; Assessed model performances using cross validation and achieved 96% classification accuracy Work Experience


Data Analyst / Salesforce Developer Intern Apr. 2018 – Present

Data Analytics: Take charge of data cleaning, retrieving and reporting on internal CRM system using SQL and Excel, recommended best approaches for its consolidation to enhance data quality, then assisted with customized Salesforce development

Marketing Analytics: Developed Pardot platform to drive greater marketing automation by increasing lead generation and ROI, processed marketing data analytics of clients from Yelp Business on Ad Impressions and Click Rate JOBROBIN NEW YORK, NY

Analyst of Columbia Capstone Jun. 2017 – Aug. 2017

Data Visualization: Created interactive hierarchical bubble graphs in Tableau of data science job descriptions for job seekers, resume visualization for recruiting managers, specifically visualized the percentage and mastery of skills learned from school and work experience

Data Analytics: Implemented sampling on competitors, such as LinkedIn, Indeed and, to analyze key word frequency of hard and soft skills of data scientist, data engineer and data analyst MASSMUTUAL ASIA HONG KONG, CHINA

Business Analyst (Project Marketing Leader) Aug. 2012

Case Consulting: Worked on business case analysis for Premier Choice Flexi Fund Program to double sales in China

Marketing Analytics: Led project team to design marketing plans by PEST and SWOT analysis, developed 4P’s marketing strategies on building brand, customer acquisition and retention

Contact this candidate