Post Job Free
Sign in

Data Python

Location:
Queens, NY
Posted:
June 09, 2020

Contact this candidate

Resume:

XUMING (MARCO) WANG Email : ********@*****.***

Poforlio: http://marcoxm.github.io Mobile : 646-***-**** Jackson Ave 28-10, Jackson Park, Long Island City, 11101 technical skills

Programming Languages: : Python, SQL, Golang, and C++

Data Engineering: AWS( Redshift, EMR, Kinesis, RDS and ), Spark, S3, and AirFlow.

Data Science:: Scikit-Learn, Pytorch, TensorFlow,Sagemaker, SPSS, and SAS.

DevOps:: AWS Cloudformation, Jenkins, Circleci, Git Docker, and Kubernetes.

Business Intelligence Tool:: Tableau; Spot re, QlikView, Altery, Matplotlib, Seaborn, ggplot

Kaggle Competition: University of Liverpool - Ion Switching - Bronze Medal top 8% (192/2,618) education

Fordham University New York City, NY

Master of Science in Business Analytics, STEM GPA:3.80 Aug. 2018 { Dec. 2019

Guangdong University of Foreign Studies Guangzhou, China Bachelors Degree in Logistics( Supply Chain Management); Aug. 2011 { July. 2015 experience

Wright Star LLC New York City, NY

Financial Data Analyst Oct 2019 - Present

Portfolio Management: Automated the data updating process for data ETF and stock model using Python. Improved the overall e ciency by enabling the models to update the daily closing price of stocks automatically

Quantitative Modeling: Built a new nancial model using Excel and Python, which classi es stocks based on multiple nancial indicators, and provide insights to traders in nding valuable long-term stock investments

DHL-Sino Trans International Air Courier LTD Dongguan, Guangdong, China Operation Analyst Jan 2017 - Aug 2018

Analytics Report: Updated data process by using Python and Excel.identi ed root causes and develops potential solutions, Improved the overall customer satisfaction score 15% by using root cause analysis, and delivered actionable solution plan.

Project Management: Leads corss-department team and participates in operation e ciency improvement project. Communicated and mined the needs of each department. Applied insights to the development, execution, and improvement of action plans.

projects

Lending Club Loan Interest Rate Prediction: New York City, NY Aug. 2019 { Dec. 2019

Built machine learning models to predict the risk of loan, based on 2019 Q1-Q3 Loan raw data with over 800,000 records and 150 features

Performed missing value imputation, feature engineering and feature selection through exploratory analysis

Validation; Predicted on test set, with F1 of and 0.98 by GBDT and logistics refression and 0.98/1 by Lightbgm

Web Analytic on Airbnb Reviews: New York City, NY Sep. 2018 { Dec. 2018

Pre-processed review data with 100000 reviews of Manhattan 2018 crawled from internet, lled missing value, grouped by countries, gender or age to get the statistic information.

Identi ed the Sentiment of guests to the house. By using NLP model to generate the sentiment score for each house by region or countries.

Used clustering model to gure internal relationship between fearures and review score and conducted into visualization

Patient Readmitted Prediction: New York City, NY Sep. 2018 { Dec. 2018

Pre-processed data with 100000 records of diabetes patients by conducting EDA, data cleaning and implementing appropriate feature encoding with Pandas, Numpy and Scikit-learn libraries

Written pipeline to conduct features selection ( lter/ wrapper) and ensemble methods for classi ers including soft-margin SVM, Random Forest, Perceptron, Logistics Regression and Naive Bayes



Contact this candidate