Y un Li
** ******* **** **** *** Brunswick, NJ **901 732-***-**** *****.*@*****.***
PROFESSIONAL SUMMARY
More than eight years of total experience in handling data with data integrity during extraction,
manipulation, processing, analysis and storage.
Strong knowledge in advanced analytics and data mining techniques.
Working experience in R, SAS/BASE, SAS/STAT, SAS/MACRO, SAS/SQL, UNIX and Windows
environments.
Strong skills in developing SAS programs for data analysis and working with analysis of variance
(ANOVA), regression analysis, linear models, multivariate analysis, predictive solution models including
linear, multiple, polynomial and nonlinear regression models.
Processed large datasets for data transformation including data cleansing, data scrubbing and applying
business logic rules to incoming data.
Expertise in advance SQL programming for joining multiple tables, sorting data, creating SQL views,
creating indexes, campaign management and metadata analysis.
Good command in using R, SAS/BASE, SAS/STAT, SAS/MACRO, SAS/SQL, SAS/ODS, SAS/GRAPH,
Excel, MS Office tools.
Created SAS programs to generate new datasets from raw data files imported and modified existing datasets
using SQL set, merge, sort and formats.
Good command in SAS procedures, reporting and summary procedures, generating reports employing
various SAS procedures like PROC PRINT, PROC REPORT, PROC PLOT, PROC CHART, PROC SQL,
PROC SUMMARY, PRO FREQ, PROC TABULATE, PROC MEANS, PROC UNIVARIATE, PROC
FORMAT and PROC TRANSPOSE.
Excelled at identifying, developing and using strengths of team members, as well as locating, detecting and
resolving problems and weaknesses of each team individual.
Meet with customers to determine their needs, gather and document requirements, communicate with
customers throughout the development project to manage customer expectations, resolve issues and provide
project status.
Coordinates project development and implementation activities.
Query reports using SQL.
Design, coding, testing, debugging and documentation of multiple treatment. Prepare weekly, monthly
reports by PowerPoint, Excel, pivot tables and pivot charts, etc. Present findings and recommendations to all
levels of senior management.
Develop SAS macros for automating the analysis, process and report generation.
Excellent programming skills in developing programs to increase the performance and efficiency of SAS
programs and importing, extracting and loading the data into various servers like SQL server, DB2 and
Oracle.
EDUCATION
M.S., Statistics Rutgers University, New Brunswick NJ, Oct. 2014, GPA: 3.67
Ph.D., Environmental Sciences Rutgers University, New Brunswick NJ, May. 2014, GPA: 3.67
B.E., Environmental Engineering Tianjin University, Tianjin, P. R. China, Jun. 2004, GPA: 3.20
CORE COMPETENCIES
R, RStudio, SAS/STAT, SAS/BASE, SAS/SQL, SAS/MACRO, SAS/ODS, Neo4j, Excel, UNIX
Regression Principal component analysis Factor analysis Clustering
Classification Nonparametric analysis Matrix Factorization Stochastic
PROFESSIONAL EXPERIENCE
Revenue and Profit Management Walt Disney Company 2015 - Present
Decision Science Professional Intern Project
Cluster analysis for guests experience in Walt Disney Theme Parks.
Merchandize missing data replacement analysis using matrix factorization.
Specialist knowledge in machine learning, data visualization, statistical modeling, data mining, or
information retrieval.
Strong data extraction and processing, using Graph Database and Neo4j as well as proficiency in
analysis with R and SAS.
Strong ability to implement, maintain and troubleshoot big data infrastructure, such as machine
learning, conceptual modeling, statistical analysis, predictive modeling, and hypothesis testing.
Experience working with IT strategy teams, business teams and business analysts to define
information systems, services and management.
Department of Statistics and Biostatistics Rutgers University 2013 - 2014
Course Project
Developed factor and cluster analysis for orthopedic material in hospitals; Regression model result
indicated 11 hospitals among 2000 hospitals that could maximize the sales gains.
Derived theoretical conditions for estimation of networks in woman/minority business and small
business data; developed a multi-scale clustering method and designed classification algorithms for
model prediction.
Developed forecasting time series models for hourly temperature data from multiple locations.
Applied Mantel-Haenszel’s method and developed Chi-Square test for heart attack with different
age groups.
Using Newton-Raphson method to estimate parameter in a disease-causing bacteria model.
Created R packages for graphical analysis function for general bio-data plots.
Department of Environmental Sciences Rutgers University 2006 - 2012
Graduate Assistant
Data collection, data manipulation, nonparametric analysis, data visualizations
Test/control analysis for DNA and RNA sequence & phylogenetic analysis of bacteria.
Test/control analysis for chemical instrument analysis including HPLC/GC/IC.
Provided weekly project report and Powerpoint presentations to teams in department labs.
Third place student research contest, sponsored by Geosyntec Consultants.
Department of Environmental Science and Engineering Tianjin University, China 2003 - 2004
Undergraduate Thesis
Linear regression analysis of water supplies in the complex regional systems in Hengshui city,
Hebei Province, China.
ADDITIONAL EXPERIENCE
Teaching Assistant, Department of Environmental Sciences, Rutgers University, NJ 2011-2012
Graduate Assistant, Department of Environmental Sciences, Rutgers University, NJ 2007-2011
CERTIFICATIONS
SAS® Advanced Programming Certification, earned August 27, 2014
SAS® Base Programming Certification, earned August 20, 2014