Sign in

Data Project

Houston, Texas, 77030, United States
October 18, 2016

Contact this candidate
Sponsored by:
Post Jobs to
Multiple Job Boards &
Get more Candidates
Try it Free!
Start your 30-day
Free Trial


**** ***** **, *******, **, *****, 713-***-****,

Objective: Seeking a full time position as a Data Scientist/Analyst/Statistical programmer Skills: Proficient in Python, R, SAS, SQL/T-SQL, Linux/Unix, Apache Spark, Azure, Predictive modeling, Experimental Design Working and Project Experience:

Data Scientist at I-Click Interactive Beijing, China 5/2014-6/2015

• Used machine learning techniques and statistical analysis to propose optimal digital advertising strategies.

• Constructed predictive models to identify the potential customers to improve the efficiency of the sponsored advertising for clients. Performed feature engineering, missing data imputation in the predictive models. Evaluated and predicted the effects of digital advertising.

• Designed a Recommendation System collaborated with other data scientists for an Online Dating Website to automatically recommend profiles for users. Proposed ensemble model consists of 13 basic models. Data Analyst intern at China Development Bank Wuhan, China 2/2013-8/2013

• Supported and maintained the customers database management. Processing, cleansing, and verifying the integrity of data used for analysis. Performed statistical analysis and data visualization on KPI to give reports.

• Provided advanced analytical support to the China Development Bank organization with Excel VBA. Project, Analysis of Web Server Log file, Text mining (Python, Apache Spark, SQL) 6/2016-8/2016

• Parsed and transformed log text file from NASA Kennedy Space Center web server into structured data to maintain user information and to detect abnormality. Improved user experiences based on statistical methods (hypothesis testing). Project, Loan Granting Binary Classification (R, Azure ML) 7/2016-8/2016

• Predicted if a loan applicant will fully repay or default the loan based on the dataset consist of over 250,000 records with 19 features. Classifiers (Logistic regression, KNN, Random Forests) were combined to reach 95% accuracy. Project, Analysis of Living condition survey of second generation immigrants (SAS) 4/2016-5/2016

• Performed quantitative analysis for living condition of the second-generation immigrants in the USA based on American population census database. Built statistical models to identify factors affecting the education and income of the immigrants

(ANOVA, Regression).

Project, Financial Investment Portfolio based on KPI (R) 11/2015-12/2015

• Performed quantitative and graphics analysis on KPI based on 50 years financial statement database of over 15000 companies in America Stock Market. Implemented Piotroski score method to propose the optimal investment portfolio by identifying top- performing and under-performing stocks.


Rice University, Master of Statistics, Statistical Computing and Data Mining, GPA: 3.52/4.0 9/2015-Expected 12/2016

• Core courses: Data mining and statistical learning, Regression and Linear model, Probability, Generalized Linear Model and Categorical Analysis, Design and Analysis of Algorithm Wuhan University of Technology, Bachelor of Information and Computing Science, GPA: 3.89 /4.0 9/2011-6/2015

• Core course: Advanced algebra, Database implementation, Data structure and algorithm, Numerical analysis Certifications:

• SAS Certified Base Programmer for SAS 9. SAS Certified Advanced Programmer for SAS 9. Publications:

• Yutao Ren, Yichao Zhang, Ke Zhu, Background Extraction and Snow Remove Form Video [C], ISEEIP 2013

Contact this candidate