Post Job Free
Sign in

Data Analyst/ Data Scientist

Location:
Brooklyn, NY
Salary:
above 70,000/ annual
Posted:
January 04, 2021

Contact this candidate

Resume:

Yingyuan (Valerie) Zhang

+1-608-***-**** Brooklyn, NY, 11201 **********@*****.***

Education

New York University (Master of Science, pending December 2020) 09/2019- Present Major: Applied Data Science GPA: 3.75/4.00

Courses: Database, Simulation Model, Optimization, Machine Learning, Deep Learning, Big Data University of Wisconsin- Madison (Bachelor of Science) 09/2017- 05/2019 Major: Economics with Mathematical Emphasis GPA: 3.50/4.00 Missouri State University (Associate of Arts) 09/2015- 06/2017 Major: International Business GPA :3.78/4.00

Skills

Programming: Python (Pandas, Matplotlib), SQL (PostgreSQL, SQLite), Big Data (Hadoop, SparkSQL, PySpark, MapReduce), R, UNIX, Git

Data Analysis: Machine Learning (Scikit-learn, SciPy), A/B Testing, Predictive Analysis, Linear Regression, Clustering, Recommendation Systems, Time Series Analysis, Data Visualization Software: Tableau, AWS (S3, EC2), STATA, Advanced Excel, Salesforce, Google Analytics Project

Capstone: Analyzing Covid-19 Impact on Urban Mobility Change of NYC Nighttime Industry Data Scientist 01/2020- 07/2020

- Built real- time mobility data ETL pipeline and created the baseline model from 2019 to 2020 for comparing nightlife venue volume change before and after COVID-19

- Trained Time Series (ARIMA) model on transportation system to understand feature importance and to predict job loss over 84% of nightlife industry during COVID-19

- Built Multivariate Linear Regression models to analyze correlations among nightlife industry growth, mobility pattern changes and COVID-19 confirmed cases Instacart User Segment and Market Basket Analysis

Data Scientist 03/2020- 06/2020

- Conducted market basket analysis by applying PCA over 40 features to reduce dimensions and using K-means Clustering to generate shopping behaviors and built user segmentation based on time intervals

- Applied Random Forest multiclass classification to identify customers into three classes and built collaborative recommendation system to predict customer’s shopping basket of next purchase Yelp APP Rating and Reviews Analysis

Data Analyst 10/2019- 12/2019

- Extracted, cleaned and manipulated raw data as well as visualized over 40K restaurants data

- Trained Random Forest and classification model to analyze key business metrics of restaurants, accuracy of test samples is 0.76

- Performed LSTM sentiment analysis and word clouds of user reviews and to provide business insight and help restaurants to improve rating on Yelp

Work Experience

Data Analyst Internship, Philips, Co., Ltd, China 12/2017- 03/2018

- Introduced KPI dashboard and analyzed discount sales with yearly sellout data via SQL, Excel and Tableau, helped improve KPI calculation efficiency by 25%

- Built data survey and standardized reports, increased customer response rate by 9% Financial Analyst Internship, Manulife, Shanghai City 05/2017- 08/2017

- Leveraged Time Series model VAR and ARIMA to predict insurance sales trends

- Ensured correct data interpretation and worked with internal teams in managing financial data and anomaly detection of 1000+ clients, increasing insurance recommendation accuracy by 17% Activities

Writer, Toward Data Science (https://medium.com/@yz6378) 09/2020- Present Teaching Assistant of Urban Economics, New York University 01/2020- 06/2020 Volunteer, provide supports for a child and his family in Lesotho, Africa 09/2015- Present



Contact this candidate