Post Job Free
Sign in

Data Project

Location:
New York, NY
Posted:
January 10, 2017

Contact this candidate

Resume:

Chuhan Yang

*** **** *** ******, *** *F, New York, NY 10027 • ******@********.*** • 917-***-****

EDUCATION

Columbia University, Graduate School of Arts and Sciences, New York, NY September 2015-December 2016 MA in Statistics Cumulative GPA:3.63

Relevant Coursework: Data Mining, Time Series Analysis, Survival Analysis, Bayesian Statistics, Linear Regression Models, Statistical Machine Learning, Advanced Data Analysis Wuhan University, School of Mathematics and Statistics,Wuhan,China September 2011-June 2015 BS in Mathematics Base GPA: 3.3/4.0

Relevant Coursework: Mathematical Analysis, Mathematical Modelling, Complex Analysis, Functional Analysis, Real Analysis, Number Theory and Coding Theory

SKILLS

R, Matlab, Python, Tableau, MySQL(basic), Julia(basic) WORK EXPERIENCE

Gizwits, Guangzhou, China 2016.07-2016.08

Algorithm Engineer Intern

Research on anomaly detection algorithm in the AI team for Gizwits showcase.

Updated parameter-free and window comparison anomaly detection algorithm by SAX transformation and window size determination, which successfully found out NY taxi data’s top 5 anomaly periods due to festival or weather.

Performed data cleaning and exploratory data analysis on Huashang Sanyou charging piles data in prepare of project negotiating.

Transport Planning Research Institute, Guangzhou, China 2016.06 Megalopolitan Coordinate Analysis Project

Gathered various related data of 38 megalopolises, perform dimension reduction for visualization requirements.

Solved the interpretation problem by performing hierarchical clustering, produced dendrogram which provides reasonable reference direction for Guangzhou’s future development.

Update original model by abandoning entropy weight method, using sparse PCA and correlation-based distance, and organizing all the results for final analysis report. Urban Transportation Research Center, Beijing, China 2013.07-2013.08 TOD Project

Participated in the formulation of the transportation development and construction plan of Kunming, Yunnan Province.

Predicted passenger flow in PT Hub in order to provide basis for rationally specifying construction scale of the public transport hub and transport connection means.

Assisted in making public transportation model of Kunming, Yunnan Province. EXTRACURRICULAR EXPERIENCE

Advanced Data Analysis Project, Columbia University 2016.10-2016.12

Exploratory data analysis on water table dataset and produced interactive plots of function status in Tanzania.

Performed multiclass classification using random forests, logistic regression and XGBoost on water table dataset.

Predicted water pumps' functional status in Tanzania, reached an accuracy of 82.45%, ranked top7 among 1800 teams in DRIVENDATA competition.

Steam Game Recommendation Project, Data Application Lab 2016.09-2016.10

Built crawler to extract game inventory of each steam user id and app details using various steam API.

Preprocessed the crawled data, performed data cleaning and feature selection, loaded data into MySQL.

Built recommendation model to present the top 10 recommendation for each steam user using recommendation module from pyspark mllib package.

Director of Publicity Department, Mathematical Modeling Association, Wuhan University 2011.09-2013.03

Organized various activities including Sudoku Contest, Math Gaming Salon and the recruiting of the Association.

Independently designed various posters and leaflets, attracting many students to join in the Association. AWARDS AND HONORS

2016.11, Columbia StatFest 1st Place (2-day hackathon), Columbia Statistics Club & STATS.org 2013-2014, Wuhan University Outstanding Student



Contact this candidate