XUMING (MARCO) WANG Email : ********@*****.***
Poforlio: http://marcoxm.github.io Mobile : 646-***-**** Jackson Ave 28-10, Jackson Park, Long Island City, 11101 technical skills
Programming Languages: : Python, SQL, Golang, and C++
Data Engineering: AWS( Redshift, EMR, Kinesis, RDS and ), Spark, S3, and AirFlow.
Data Science:: Scikit-Learn, Pytorch, TensorFlow,Sagemaker, SPSS, and SAS.
DevOps:: AWS Cloudformation, Jenkins, Circleci, Git Docker, and Kubernetes.
Business Intelligence Tool:: Tableau; Spot re, QlikView, Altery, Matplotlib, Seaborn, ggplot
Kaggle Competition: University of Liverpool - Ion Switching - Bronze Medal top 8% (192/2,618) education
Fordham University New York City, NY
Master of Science in Business Analytics, STEM GPA:3.80 Aug. 2018 { Dec. 2019
Guangdong University of Foreign Studies Guangzhou, China Bachelors Degree in Logistics( Supply Chain Management); Aug. 2011 { July. 2015 experience
Wright Star LLC New York City, NY
Financial Data Analyst Oct 2019 - Present
Portfolio Management: Automated the data updating process for data ETF and stock model using Python. Improved the overall e ciency by enabling the models to update the daily closing price of stocks automatically
Quantitative Modeling: Built a new nancial model using Excel and Python, which classi es stocks based on multiple nancial indicators, and provide insights to traders in nding valuable long-term stock investments
DHL-Sino Trans International Air Courier LTD Dongguan, Guangdong, China Operation Analyst Jan 2017 - Aug 2018
Analytics Report: Updated data process by using Python and Excel.identi ed root causes and develops potential solutions, Improved the overall customer satisfaction score 15% by using root cause analysis, and delivered actionable solution plan.
Project Management: Leads corss-department team and participates in operation e ciency improvement project. Communicated and mined the needs of each department. Applied insights to the development, execution, and improvement of action plans.
projects
Lending Club Loan Interest Rate Prediction: New York City, NY Aug. 2019 { Dec. 2019
Built machine learning models to predict the risk of loan, based on 2019 Q1-Q3 Loan raw data with over 800,000 records and 150 features
Performed missing value imputation, feature engineering and feature selection through exploratory analysis
Validation; Predicted on test set, with F1 of and 0.98 by GBDT and logistics refression and 0.98/1 by Lightbgm
Web Analytic on Airbnb Reviews: New York City, NY Sep. 2018 { Dec. 2018
Pre-processed review data with 100000 reviews of Manhattan 2018 crawled from internet, lled missing value, grouped by countries, gender or age to get the statistic information.
Identi ed the Sentiment of guests to the house. By using NLP model to generate the sentiment score for each house by region or countries.
Used clustering model to gure internal relationship between fearures and review score and conducted into visualization
Patient Readmitted Prediction: New York City, NY Sep. 2018 { Dec. 2018
Pre-processed data with 100000 records of diabetes patients by conducting EDA, data cleaning and implementing appropriate feature encoding with Pandas, Numpy and Scikit-learn libraries
Written pipeline to conduct features selection ( lter/ wrapper) and ensemble methods for classi ers including soft-margin SVM, Random Forest, Perceptron, Logistics Regression and Naive Bayes