Post Job Free

Resume

Sign in

Data Analysis

Location:
Somerset County, NJ
Posted:
November 15, 2015

Contact this candidate

Resume:

Karen DI HAO

* ****** **, ******** ** ***** Cell: 848-***-**** E-mail: acsguz@r.postjobfree.com

SUMMARY

Proficient in SAS (BASE/Macros/Graphs/STAT/ODS/SQL), R, Excel, Stata, Java, C++, Matlab programming;

Experienced in data analysis on large dataset, predictive modeling, hypothesis testing and statistical consulting;

Creative, detail-oriented, dedicated statistical analyst and programmer. EDUCATION & CERTIFICATION

Master of Science: Statistics Rutgers University, New Brunswick NJ 09/2013 to 12/2015

Bachelor of Engineering: Information Engineering Beijing Technology and Business University Beijing 09/2009 to 05/2013

SAS Certified Base Programmer for SAS 9 SAS Certified Advanced Programmer for SAS 9 RELEVANT COURSES

Biostatistics (clinical trail), Data Mining, Time Series, Nonparametric Analysis, Multivariate Analysis, Design of Experiments, Mathematical Inference, Interpretation of Data, Regression Analysis. EXPERIENCE

Intern, SAS Programmer 06/2015 to current

BDM Consulting, Inc.

Developed and validated SDTM data mapping and SDTM datasets per CDISC standard; Created annotated CRFs and Define.xml for eSubmission; Performed statistical quality assurance review on the SDTM, ADaM datasets, mapping specification, table listing and figures. Participated in training intern.

Research Assistant, Program Evaluation Consultant 09/2014 to 03/2015 Technical Consulting & Research, Inc.

Generated 3D-pie plots, Likert plots and Histogram to investigate the experience in travel and tourism of alumni from different countries; Utilized R to conduct preliminary data analysis, correlation analysis, CHAID analysis; Wrote report drafts and provided statistical advices for program improvements. COURSE PROJECTS

Data Mining: Musk Classification ( Using R )

Compared several data mining classification methods (LDA, FDA, Logistics regression, tree, SVM, boosting and random forest etc.) to figure out the most reliable model to do the prediction. Presented the findings in class and wrote the final report, listing all the test errors as criteria and choosing an SVM model with certain parameters as the final model.

Time Series: Simulated Model for Mortgage Rate ( Using R ) Developed an ARIMA model to summary the activity of mortgage rate and form reliable forecast. Utilized R to stabilize raw data and do time series analysis, proposed several models and check the validation of each of them. Identified a best model based on their complexity and rolling forecast errors.

Exploratory Data Analysis: Sales of Orthopedic Equipment (Using SAS) Generated a list of potential customers to maximize the profits. Utilized SAS to do data transformations, factor analysis, cluster analysis, regression trees and estimated potential gains in sales.

Exploratory Data Analysis: Analyzing Survey Data of Wine Preferences (Using SAS) Exploited SAS to generate various statistical tests like Chi-Square test, Fisher’s exact test and Wilcoxon’s test, explored any pattern of differences (or similarities) between the answers to the questions of different groups.

Regression Analysis : Investigation on Cereal Rating Data ( Using R ) Constructed a regression model for the cereal data and developed a rating system. Utilized R to do initial data exploration, model selection

(forward, backward, stepwise methods) and model verification (check for multicollinearity, heteroscedasticity. and normality of residuals).



Contact this candidate