Yingjie LI
actdis@r.postjobfree.com +1-530-***-**** ***0 Wake Forest Dr Apt 208, Davis, CA, 95616
SKILLS:
Specialties: Data analysis, Business analysis, Statistical Investigation and Analysis, Multivariable Statistics, Macroeconomics Statistical Analysis, Modeling, Machine Learning
Programming Skills: MS office, SAS, EVIEWS, SPSS, R Language, SQL, Python
PROJECTS
Working with "Big Data" 05/2015-09/2015
Used three approaches (see below) to compute the deciles of the total amount less the tolls and make some simple regression models with the New York taxi data (>12G)
The R Programing Environment: Divided a large collection of observations into smaller chunks by just reading/processing chunks of the observations sequentially with read.csv and sapply functions
The UNIX Shell: Used wc, cut and grep commands to cut the useful columns and appended each of the taxi files to a new file. Then, invoked shell commands from R and read the resulting output back into R
Used Parallel Computing: Wrote a Parallelizable Loop with foreach. Then used the Split-Apply-Combine Approach for better performance and applied Split-Apply-Combine to find the answers
The Biham-Middleton-Levine Traffic Model 04/2015-08/2015
Wrote three versions of function groups and plot the GIF with R to simulate this process and found where the bottlenecks are occurring
Found the fastest function group and tried to find out how to improve the running time: by using vectorization.
Implemented the BML model with C by using ‘for’ loops and compared the running time for that with R: C could run much faster than R
Created R packages which contains my functions, help files and anything else
Abalone dataset exploration and statistical inference 12/2014-02/2015
Applied statistics methods to obtain a brief image of the data set
Utilized best subsets algorithms and stepwise regression procedure to select the best model
Checked model validation including internal and external validation
Constructed 99% confidence intervals and prediction intervals under the Model
The degree of social integration of migrants analysis 02/2014-08/2014
Did questionnaire survey in various regions of Zhejiang Province
Collated data and found the major variables which would be meaningful explained
Used principal component analysis to find the principal components which explains 85% or more of the proportion of variation
Used factor analysis and did factor rotation to find the common factors and name after them
Estimated factor scores and found out which variables get the highest score
WORK EXPERIENCE
Analyst Yong’an Insurance Company Hangzhou 06/2013-09/2013
Conductd comprehensive analysis and evaluations of business needs
Prepared customized reports to help business partners by providing a strong competency in understanding risk and accordingly performed better customer selection and customer satisfaction
Involved in credit risk assessment model to calculate risk factor for individual clients based on hierarchy
Experienced with data mining models in finding the sequential trend of the data
Performed updating data by weekly and monthly and maintained, manipulaed the data for database management
EDUCATION BACKGROUND
University of California Davis Graduated: December 2015
Master of Statistics
Main courses: Analysis Categorical Data, Statistical Computing, Statistical Methods for Research
Machine Learning, Data Mining, Statistical Programming
Zhejiang Gongshang University Graduated: June 2014
Major: Economics Statistics
Main courses: Multivariable Statistics, Time Sequence Analysis, Macroeconomics Statistical Analysis, Statistical Forecasting and Decision, Statistical Investigation and Analysis
HONORS
2011 Won a prize for excellence of the Fourth “Poll Cup” Statistical Investigation Conceptual Design Contest in Zhejiang Province
2012 Won the first prize of the university’s “Hope Cup” Mathematical Contest in Modeling Competition