Post Job Free
Sign in

Data Analyst Python

Location:
Cupertino, CA
Posted:
May 14, 2020

Contact this candidate

Resume:

Xingmin (Jay) Bao

San Jose, CA, *****@***.***, 949-***-****

Summary

* *****+ academic research experience and 3 years+ Business/Data analytics experience with in-depth knowledge of Business intelligence, Data Analytics, Data Engineering, Machine learning, and Statistical Modeling. Aiming to utilize strong cutting-edge analytics skills to help the company achieve its long-term goals.

Languages, Skills and Tools

Python (Pandas, Scikit-learn, Numpy), R, Machine Learning (Supervised Learning, Unsupervised Learning), SQL, Big Data (PySpark), Data Visualization (Tableau, Ggplot2, Seaborn, Matplotlib), Google Analytics, MS Excel, A/B Testing

Work Experience

EPlanet Capital (Private Equity) San Jose, CA

Role: Data Analyst Intern January 2019 – January 2020

Conducted machine learning and multivariate regressions to develop stock selection recommendation system, achieving 10% more accurate rate of buy and short recommendation and 20% more average portfolio return than previous system

Partnered with team of 4 people to apply Python API and SQL queries to automatically import company filings from external (SEC edgar, wrds) and internal sources

Researched and evaluated emerging data tools and techniques; Founded and selected 20 more variables that have significantly influence on stock return based on statistical hypothesis test result

UC Berkeley, Haas School of Business Berkeley, CA

Role: Research Data Analyst August 2019 - December 2019

Extracted, cleaned, aggregated, and manipulated required data from Dow Jones database

Employed new and existing analytical models to support hypotheses and finance theories; designed and interpreted analyses

ZGC Innovation Center (Consulting Firm) Santa Clara, CA

Role: Business Analyst Summer Intern June 2019 – August 2019

Used Erwin-modeling for reverse engineering and hosting according to the business requirements on existing models

Increased traffic and member subscriptions by 20% through email marketing based on customer clustering (K-means)

Tracked and performed exploratory data, successfully interpreted via Python and Tableau to identify business improvement trends and draw conclusions for managerial strategy

Granada Cabinet Import Company Orange County, CA

Role: Supply Chain Analyst January 2018-August 2018

Minimized supply chain risks and developed alternatives that assure consistent flow of materials and product

Investigated time series trending patterns and seasonal patterns in R for each month and quarter to predict market demands

Education

Santa Clara University August 2018 - January 2020

Master of Science in Business Analytics STEM

University of California Irvine September 2013 - June 2017

Bachelor of Arts, Business Economics

Projects

Climate Change Prediction - (Time series modeling) February 2020 – April 2020

Preprocessed a 20GB dataset from university e-library using Python SQLite into SQL database; compared RMS error of ARMA, Fourier and Holts model to determine the best predictive model for future domestic temperature

Bank Fraud Detection September 2019 – December 2019

Built a data pipeline using Python and SQL; utilized Logistic Regression, Decision Trees, and SGDClassifier to develop a better fraud transaction indicator with 95% accuracy and 0.93 AUC

Customer Churn Analysis April 2019 – June 2019

Lead a team of 4 people to do the customer classification and to build the prediction model (GBM and RF) for current subscribers; suggested business strategies to decrease the churn rate

Certifications: Python Foundation Certification from Codecademy, SQL Certification from Udemy, Finance/Accounting Certification from Irvine Valley College

Working Visa: Green Card holder



Contact this candidate