Xingmin (Jay) Bao
San Jose, CA, *****@***.***, 949-***-****
Summary
* *****+ academic research experience and 3 years+ Business/Data analytics experience with in-depth knowledge of Business intelligence, Data Analytics, Data Engineering, Machine learning, and Statistical Modeling. Aiming to utilize strong cutting-edge analytics skills to help the company achieve its long-term goals.
Languages, Skills and Tools
Python (Pandas, Scikit-learn, Numpy), R, Machine Learning (Supervised Learning, Unsupervised Learning), SQL, Big Data (PySpark), Data Visualization (Tableau, Ggplot2, Seaborn, Matplotlib), Google Analytics, MS Excel, A/B Testing
Work Experience
EPlanet Capital (Private Equity) San Jose, CA
Role: Data Analyst Intern January 2019 – January 2020
Conducted machine learning and multivariate regressions to develop stock selection recommendation system, achieving 10% more accurate rate of buy and short recommendation and 20% more average portfolio return than previous system
Partnered with team of 4 people to apply Python API and SQL queries to automatically import company filings from external (SEC edgar, wrds) and internal sources
Researched and evaluated emerging data tools and techniques; Founded and selected 20 more variables that have significantly influence on stock return based on statistical hypothesis test result
UC Berkeley, Haas School of Business Berkeley, CA
Role: Research Data Analyst August 2019 - December 2019
Extracted, cleaned, aggregated, and manipulated required data from Dow Jones database
Employed new and existing analytical models to support hypotheses and finance theories; designed and interpreted analyses
ZGC Innovation Center (Consulting Firm) Santa Clara, CA
Role: Business Analyst Summer Intern June 2019 – August 2019
Used Erwin-modeling for reverse engineering and hosting according to the business requirements on existing models
Increased traffic and member subscriptions by 20% through email marketing based on customer clustering (K-means)
Tracked and performed exploratory data, successfully interpreted via Python and Tableau to identify business improvement trends and draw conclusions for managerial strategy
Granada Cabinet Import Company Orange County, CA
Role: Supply Chain Analyst January 2018-August 2018
Minimized supply chain risks and developed alternatives that assure consistent flow of materials and product
Investigated time series trending patterns and seasonal patterns in R for each month and quarter to predict market demands
Education
Santa Clara University August 2018 - January 2020
Master of Science in Business Analytics STEM
University of California Irvine September 2013 - June 2017
Bachelor of Arts, Business Economics
Projects
Climate Change Prediction - (Time series modeling) February 2020 – April 2020
Preprocessed a 20GB dataset from university e-library using Python SQLite into SQL database; compared RMS error of ARMA, Fourier and Holts model to determine the best predictive model for future domestic temperature
Bank Fraud Detection September 2019 – December 2019
Built a data pipeline using Python and SQL; utilized Logistic Regression, Decision Trees, and SGDClassifier to develop a better fraud transaction indicator with 95% accuracy and 0.93 AUC
Customer Churn Analysis April 2019 – June 2019
Lead a team of 4 people to do the customer classification and to build the prediction model (GBM and RF) for current subscribers; suggested business strategies to decrease the churn rate
Certifications: Python Foundation Certification from Codecademy, SQL Certification from Udemy, Finance/Accounting Certification from Irvine Valley College
Working Visa: Green Card holder