Post Job Free
Sign in

Data Scientist with Python, SQL, AWS, Spark

Location:
Tempe, AZ
Posted:
April 06, 2020

Contact this candidate

Resume:

Yu-Wei (Jacky) Chung

+1-480-***-**** ********@***.*** Portfolio, LinkedIn, GitHub Tempe, Arizona EXPERIENCE

Data Science Intern November 2019-Present

MASTER ELECTRONICS (U.S. Market Leader in Electronic Components distributing), Phoenix, Arizona

• Mined and extracted order records using SQL and AWS and implemented customer segmentation via Python for developing dynamic demanding strategies; analyzed customer preference and built customer profiles by K-means clustering from 10M+ data

• Designed machine learning models (Logistic Regression, Random Forest) in Python to predict stockout ratio for inventory management and marketing strategy; improved forecasting accuracy by 10%

• Developed core business metrics to prototype analysis pipeline, and built six Tableau dashboards tracking user-retention metrics

• Leveraged knowledge: Python, TensorFlow, R, Tableau, MySQL, AWS Data Scientist February 2018-July 2018

DIRECTGO (Leading Online Shopping Agent in Taiwan), Taipei, Taiwan

• Integrated seasonality factors to existing data pipelines about the prediction of customer lifetime value and established a new advanced Time Series model in Python; achieved 5% sales growth

• Created A/B test plans and formulated optimal marketing strategies for a new vendor sales program; lifted conversion rate by 13%

• Expanded social media followers by 2x by monitoring the health of promoted ads, text analytics, topic modeling, time series analysis, and posts scraping using Python and R to locate most engaging topics and keywords for FB users

• Developed new KPIs to measure performances for two stakeholders, and built five Tableau dashboards to support marketing and product teams in real-time decision-making about production optimization and pricing

• Leveraged knowledge: Python, R, Tableau, MySQL, Excel, Google Analytics Data Analyst Intern June 2015-August 2015

HEXUN.COM (The Top Financial Online Media in China), Beijing, China

• Implemented Root-Cause Analysis (Bayesian Networks) for improving marketing performance by changing the App theme

• Saved 80% reporting time by defining user-engagement metrics (DAU, ARPU) and customer lifetime value using SQL for website and App, and by developing auto dashboards to track important metrics and KPIs

• Leveraged knowledge: Excel, MySQL

PROJECTS

Medium Website: https://medium.com/@jacky308082 (Concentration on Data Science and Data Analytics) Preciser - Fantasy Prediction Website October 2019-Present

• Deployed a website for NBA audience to evaluate and estimate players performance based on ten years records (1 million)

• Built a user-based recommendation system for Fantasy Basketball users to rate players and compare total ratings in trades

• Enhanced accuracy by 10% by designing models with Gradient Boosting Machines outperformed models from other websites

• Utilized: Python, D3.js, HTML/CSS/JavaScript, Flask, MySQL, Docker, AWS Customer Revenue in Google Merchandise Store January 2018-March 2018

• Built SVM and Random Forest models for Google to predict customer revenue in each order by developing more than 30+ effective features in automated feature engineering for more than 900k rows of transaction data

• The result had an accuracy of 0.85 and enhanced the effectiveness in analyzing each order

• Utilized: Python, R, Tableau

EDUCATION

Arizona State University, W. P. Carey School of Business, Tempe, Arizona August 2019-May 2020(expected) Master of Science in Business Analytics GPA: 4.00/4.00 National Central University, Taoyuan, Taiwan September 2018-June 2019 Master of Business Administration (M.B.A) GPA: 3.83/4.00 Soochow University, Taipei, Taiwan September 2013-June 2017 Bachelor of Economics GPA: 3.33/4.00

SKILLS

• Programming: SQL, Python (sklearn, XGBoost, H2O, Pandas, Numpy, NLTK, SciPy, Flask), R, HTML/CSS/JavaScript

• Database and Visualization: PostgresSQL, MySQL, MongoDB, Tableau, D3.js, Plotly, Seaborn, Matplotlib, ggplot2, Bokeh

• Distributed Computing and Deployment Tools: AWS (EC2, S3, EMR), Spark (SQL, ML), Hadoop, Hive, Docker, Git



Contact this candidate