Xufan (Kelsier) Wang
617-***-**** **********@*****.*** linkedin.com/inxufan-wang-5404a1171
Education
Brown University Providence, RI
Candidate for Master of Science in Data Science May 2023 Relevant course: Deep Learning, Machine Learning, Statistics, Regression Analysis, Data Society GPA: 4.0/4.0 Northeastern University Boston, MA
Candidate for Bachelor of Science in Mathematics and Business Administration May, 2021 Honors: Achievement Award for academic excellence, Dean’s list GPA: 3.9/4.0 Skills
Programming: Python (Scikit-learn, Pandas, NumPy), SQL(MySQL), R, Julia, GitHub, Jupyter Notebook, Latex
Data analysis and Visualization: Tableau, Seaborn, Ggplot2
Language Skills: Bilingual in English and Chinese (Mandarin) Employment Held:
Shanghai Information Development Research Association Shanghai, China Data Scientist Intern Jan 2021 – Aug 2021
Utilized Hive to extract summary statistics from the database, built data engineering pipelines to process product type, assets, production equipment as input features to personalized recommendation algorithms
Collaborated with the Product Team to assess and revise features through A/B tests (sample size 40M), evaluated metrics of the possibility to make successful information transformation and check by applying Z-Test and Chi-Squared Test
Deliver a ready-to-use model for automating turbomachinery design that enables people with limited machine learning backgrounds to use the models easily. Deliver reports, visualize results, and explain to non-machine learning background audiences and other collaborative teams HuaAn securities Co Shanghai, China
Data Engineering Analyst Intern Jul 2020 – Dec 2020
Grabbed company yearly report websites to extract desired data and built supervised learning models to investigate the most influential effects in the rare metal industry
Designed and stacked models to predict percentage change of firm’s stock price movement combining SVM, Random Forest, and XGBoost algorithms in Python and achieved R squared value of 0.74 on testing data
Created trading strategy based on model predictions to company and industry future, and conducted back testing on historical data with 2% monthly returns
Grantham, Mayo, Van Otterloo & Co.LLC (“GMO”) Boston, MA Core Data Management Internship Jan 2020-Jun 2020
Coded SQL queries to record and sort data between firm wide systems, multiple reports check to ensure proper data flow
Communicated with other teams including Reconciliation, corporate actions/Pricing, and trade operations, created complex SQL queries to meet their report requests
Project Experience
Data Science Unsupervised Machine Learning Pipeline Project – Prediction of water quality, Brown University
Employed multivariate imputation using python to handle missing values to prepare data for machine learning model
Selected key features from feature engineering to build classification model that estimated the water quality Data Science Quantitative Project – Asset Management Strategies, Brown University
Built time-series model that estimates stock prices of companies in multiple industries using Python, applied cross-validation technique to select ARIMA model with best performance
Applied Bootstrapping technique to get the lowest correlation between different financial products and giving strategies for asset management with lowest risk
Interests:
Fashion Design; Finance News; Stock market; Data Analysis; Swimming; Chess; Music; Alcohol; Entrepreneur