JUAN FANG
San Diego, CA 92122
***********@*****.***
OBJECTIVE
Seeking a full-time data analyst position where I can contribute my statistical analysis skills.
SKILLS
Statistical Skills
Data Analysis: Survival and Categorical data analysis including Life-table, Kaplan-Meier, Log Rank, Cox model,
Accelerated Failure Time model, Generalized linear model, Pearson Chi-Square test, contingency tables.
Data Mining: Experienced in cluster, discriminate and classification analysis including LDA, QDA, MDA, KNN,
CART, neutral networks and multidimensional scaling.
Multivariate analysis: Principal Component analysis (PCA), independent component, sufficient dimension
reduction, Comparison of Multivariate Means, factor analysis.
Experimental Design: ANOVA, Mixed model analysis, Regression analysis, nested design, CRD, RCBD, split-
plot, Latin Squares, Graeco-Latin Squares.
Technical Skills
SAS Tools: SAS/BASE, SAS/GRAPH, SAS/MACROS, SAS/ODS, SAS/REPORTS, PROC IML, PROC
FORMAT, PROC SQL, PROC REPORT, PROC TABULATE, PROC GLM, PROC REG, PROC LOGISTIC,
PROC MIXED, PROC LIFETEST.
Database Management: SAS, Excel and Access.
Other Software: R, JMP, SAS Enterprise Miner, Microsoft suits.
Platforms: LINUX, UNIX, Windows.
WORKING EXPERIENCE
Data Analyst Research Assistant August 2009-May 2010
Biodiversity and Spatial Information Center, Raleigh, NC
Conducted application programming, analysis, modeling and reporting.
Performed data cleaning including extracting, merging and transforming via Excel and SQL.
Implemented data mining technique (K-means cluster analysis, classification) to identify groups within
data.
Provided data queries using SAS/SQL and presented analyzing reports.
Gained experienced in working with people (biologists and GIS analysts) to ensure the project processing.
Statistical Consultant January 2010-May 2010
Statistics Department in NCSU, Raleigh, NC
Served as consultant to communicate with clients and provide statistical skills to meet their goals.
Evaluated and implemented logistic regression to fit the Canine Spinal Cord Injury categorical data.
Established significance tests (Wald test and Score test) to verify model validity.
Implemented power analysis and sample size calculation for clinical experiment.
Helped to write statistical analysis report for clients’ veterinary paper writing.
Statistician Intern May 2009 - July 2009
National Center for Atmospheric Research, Boulder, CO
Designed the statistical experiment to determine the accuracy of two interpolation methods: bilinear
interpolation and fast thin plate spline.
Established regression analysis to investigate the statistical uncertainty when interpolating random surfaces
forward and backward.
Expanded my knowledge of general linear model in quantitative research.
Learned how to implement R packages to conduct statistical analysis and generate meaningful plots.
CERTIFICATES
SAS Certified Advanced Programmer 2010
SAS Certified Base Programmer 2009
EDUCATION
Master of Statistics May 2010
North Carolina State University, Raleigh, NC
GPA: 3.84/4.0
Bachelor of Electronic Engineering June 2006
Huazhong Univ. of Sci. and Tech., Wuhan, China
GPA: 3.66/4.0
Related Courses: Experimental Statistics For Biological Sciences, Design Of Experiments, Statistical Process
Control, Applied Data Mining, Applied Time Series Analysis, Econometrics, Statistical Consulting, Multivariate
Statistical Analysis, Longitudinal Data Analysis, Categorical Data Analysis, Analysis of Survival Data.
ACADEMIC PROJECTS
Road Signs Estimation for the Third Congressional District of Raleigh 2009
Implemented sampling methods to estimate the road signs and analyzed the data in SAS procedures.
Introduced one-stage cluster sampling design with unequal cluster size to obtain the estimations which will
potentially save more than thousands of dollar for North Carolina Department of Transportation.
Created road density of Raleigh city council district via Geographic Information System to save the time of
getting road clusters.
Statistical Forecasting Studies for New Privately Owned Housing Units Started Data 2009
Performed time series forecasting analysis for economic historical data.
Implemented Augmented Dickey-Fuller Unit Root method to conduct the hypothesis testing for time series
statinarity.
Conducted model identification using ACF, PACF and IACF and verify model validity through Q statistics
(white noise), SCAN, MINIC and ESACF.
Provided forecasting for housing starts for July, August and September of 2008 to accomplish forecasting
accuracy analysis by comparing regression and ARIMA models.
Monitor process variability for individual observations 2009
Conducted quality control analysis literature search to learn the latest variability monitoring methods.
Generated data simulation for applying moving range(MR), exponentially weighted moving average moving
range control charts (EWMAMR), exponentially weighted root mean square charts (EWRMS) to monitor the
variability for individual observations using R.
Worked independently to perform quality control statistical analysis skills.
Spectral analysis for weekly Treasury Constant Maturity Rate data 2008
Performed time series spectral analysis to the 1-year and 10-year maturities time series.
Used nonparametric spectral estimation procedure to produce the estimated spectral density.
Estimated the cross spectrum and calculated the coherent frequencies between two time series.
Fitted and compared ARFIMA model and ARIMA model to the detrended series.
Investigated GARCH behavior and fitted proper GARCH model to the time series.
Effect of Meteorological Variables on the Particulate Concentration Of NO 2 in Olso, Norway 2008
Applied a generalized additive model to explore the relationship between the concentration of air pollutant
and meteorological variables.
Performed data transformation and collinearity analysis.
Computed standardized residuals and influence statistics to find potentially influential observations.
ACTIVITIES
Professional Activities 2008-present
Attended 2009 Jointed Statistical Meeting (JSM) in Washington DC.
Active member of America Statistical Association (ASA).
Other Experience
Member of Sunny Chinese Dance Performing Arts Group, Cary NC 2007- present
Performed traditional Chinese Dance (Peacock Lake) at Chinese New Year Gala in UNC.
Performed in the Dragon Boat festival at GlaxoSmithKline in 2009.
References available upon request.