Lin Yang
Office address:
Division of General Internal Medicine
University of Pennsylvania
Philadelphia, PA 19104
Telephone : (215) 573 - 6735
Email: *******@****.***.*****.***
PROFESSIONAL SKILLS
Statistical Analysis/Modeling:
Analytical, results-driven professional with skills in statistical data analysis/s tatistical modeling, SAS, STATA
& R/S-plus programming, regression analysis, nonparametric statistical methods, survival analysis and design
of experiments. Comprehensive knowledge of applied statistical methodologies including:
Regression analysis (univariate and multivariate linear regression, nonlinear regression including
logistic, poisson, negative binomial regression) & linear model (generalized linear model, hierarchical
linear models including mixed effect, random effect and fixed effect model) ;
Survival analysis (Kaplan-Meier method to estimate survivor function, cox regression for proportional
hazords model);
Propensity score analysis in observational study, propensity score matching combined with
Mahalanobis distance matching;
Categorical data analysis, non-parametric statistical methods (bootstrap technique, binomial test, sign
test, Wilcoxon signed rank test, McNeman test, Mann-Whitney test, randomization test, Kruskal-wallis
test, Friedman s test, Kolmogorov test, contigency table, etc.) ;
Model variable selection (forward, stepwise, backward model selection), model assumption diagnosis,
interval estimates and hypothesis testing;
Mixture model of uniform & Beta distribution, ranking techniques in high dimensional data analysis,
i.e., microarray data analysis, and simulation study for plasmode dataset ;
Design of experiments ( randomized blocks, factorial design, latin squares, fractional factorial design).
Statistical Software:
SAS: Import, integrate, and validate data from various raw data sources into SAS, combine da ta sheets,
manipulate and tramsform data, perform data cleaning, dataset linking and analytic dataset formation,
create list, tabular, and summary report, produce plots, pie and bar charts, and implement statistical
procedures using SAS (BASE, STAT, GRAPH, SQL, MACRO) in the UNIX environment;
STATA: Statistical analysis implemented in STATA, including fixed effect and random effect model,
cox proportional hazards model, multilevel mixed-effects logistic regression, negative binomial
regression, etc.;
R/S-plus: One year s programming experience on simulation study using high-dimensional plasmode
data set, bootstrap technique, graphing, and statistical functions;
Data management:
Manipulation of large administrative healthcare data sets (ie. Medicare claims data) using SAS
statistical software in the UNIX environmen.
PROFESSIONAL EXPERIENCE
September 2011 ~ current Biostatistician
Division of General Internal Medicine, UNIVERSITY OF PENNSYLVANIA PHILADELPHIA, PA
Projects involved:
Advancing instrumental variable methods in comparative effectiveness research;
PCMH(Patient-centered Medical Homes) and ACO(Accountable Care Organizations) external
validation study.
Major tasks and Accomplishments:
Manipulation and management of large administrative healthcare datasets such as Medicare claims data
or insurence claims data, perform data cleaning, dataset linking and creation of analytic datasets;
Implement statistical analyses by using SAS & STATA statistical packages, write program in
SAS/STATA for :
Nonbipartite matching and its statistical applications in SAS/IML software and STATA mata
language
Univariate and multivariate linear or non-linear regression, including least square regression,
logistic, poisson, negative binomial regression
Propensity score compuation, and propensity socre matching combined with Mahalanobis
matching
March 2008 ~ August 2011 Programmer Analyst
Division of General Internal Medicine, UNIVERSITY OF PENNSYLVANIA PHILADELPHIA, PA
Projects involved:
Implications of cardiovascular technology diffusion among Medicare beneficiaries;
Comparative effectiveness of cardiovascular technologies and Medicare cost growth: 2001 -2008;
In-hospital cardiac arrest (IHCA) rates in US hospitals.
Major tasks and Accomplishments:
Manipulation and management of large administrative healthcare datasets such as Medicare claims data,
perform data cleaning, dataset linking and creation of analytic datasets;
Implement statistical analyses by using SAS & STATA statistical packages, write program in
SAS/STATA for :
Generalized linear model, Mixed effect, random effect and fixed effect regression for both
continuous or binary dependent variable
Kaplan-Meier estimator of survivor function, cox regression in survival analysis
Univariate and multivariate linear or non-linear regression, including least square regression,
logistic, poisson, negative binomial regression
Propensity score compuation, and propensity socre matching combined with Mahalanobis
matching
Provide internal reports indicating the results of the computational tasks, draft plots(pie or bar chart,
bubble plots, Venn diagrams, SAS maps, etc.) for manuscripts.
August 2006 ~ August 2007 Graduate Research Assistant
Department of Mathematics & Statistics, UNIVERSITY OF MISSOURI-ROLLA ROLLA, MO
Projects involved:
A comparison of ranking techniques and FDR (False Discovery Rate) approach for the analysis of
microarray gene expression data;
Evaluating statistical methods by simulating high-dimensional plasmode data sets;
Major tasks and Accomplishments:
Write progam in R/S-plus in computing the rank of each gene based on t-statistic ( or p-value) and the
bootstrap estimates for probability of gene selection;
Write progam in R/S-plus in computing FDR(False Discovery Rate), TP(True Positive) for the
microarray gene expression data using the mixture model approach;
Write program in R/S-plus to do simulasion study using plasmode data set;
Write program in R/S-plus to produce plots for the results of comparing statistical methods using
simulated high-dimensional plasmode data set;
1997-2001 Civil Engineer
BUILDING DESIGN & RESEARCH INSTITUTE OF ZHEJIANG UNIVERSITY OF
TECHNOLOGY HANGZHOU, CHINA
Designed various building structures by using PK-PM CAD Structural Software
Provided AutoCAD drafting for construction
CERTIFICATION
SAS Certified Advanced Programmer for SAS 9
EDUCATION
2005-2007 Master of Science in Applied Mathematics with Statistics Emphasis
UNIVERSITY OF MISSOURI-ROLLA ROLLA, MO
Overall GPA : 4.0 /4.0
Member, American Statistical Association
Research report: Assessing the properties of a ranking technique for detecting differentially
expressed genes from a microarray experiment.
Advisor : Dr. Gary L. Gadbury
Course:
Statistical Data Analysis Regression Analysis
SAS Programming Nonparametric Statistical Methods
Design and Analysis of Experiments Linear Models I & II
Probability and Statistics Mathematical statistics
Advanced Calculus I &II
1993-1997 Master of Engineering in Civil Engineering
ZHEJIANG UNIVERSITY HANGZHOU, CHINA
Overall GPA : 3.31/4.0
1989-1993 Bachelar of Engineering in Civil Engineering
ZHEJIANG UNIVERSITY HANGZHOU, CHINA
Overall GPA : 3.44/4.0
Academic Excellence Scholarships for three consecutive years
Exemplary graduate at Zhejiang University
Exemplary graduate in Zhejiang Province
PUBLICATIONS
1. Raina M. Merchant, Lin Yang, Lance B. Becker, Robert A. Berg, Vinay Nadkarni, Graham Nichol, Brendan
G. Carr, Nandita Mitra, Steven M. Bradley, Benjamin S. Abella, and Peter W. Groeneveld. Variability in
Case-mix Adjusted In-hospital Cardiac Arrest Rates. Medical Care, February 2012 ; 50(2): 124-130.
2. Peter W. Groeneveld, Daniel Polsky, Feifei Yang, Lin Yang, Andrew J. Epstein. The Impact of New
Cardiovascular Device Technology on Health Care Costs. Arch Intern Med, July 25, 2011; 171(14): 1289-
1291.
3. Andrew J. Epstein, Daniel Polsky, Feifei Yang, Lin Yang, Peter W. Groeneveld. Coronary Revascularization
Trends in the United States, 2001-2008. JAMA, May 4, 2011; 305(17) : 1769-1776.
4. Peter W. Groeneveld, Andrew J. Epstein, Feifei Yang, Lin Yang, Daniel Polsky. Medicare's Policy On
Carotid Stents Limited Use To Hospitals Meeting Quality Guidelines Yet Did Not Hurt Disadvantaged.
Health Affairs, Febuary 2011; 30(2) : 312-321
5. Peter W. Groeneveld, Lin Yang, Alexis Greenhut, Feifei Yang. Comparative Effectiveness of Carotid
Arterial Stenting Versus Endarterectomy. Journal of Vascular Surgery, November 2009; 50(5) : 1040-1048.
6. Gary L. Gadbury, Qinfang Xiang, Lin Yang, Stephen Barnes, Grier P. Page, David B. Allison. Evaluating
Statistical Methods Using Plasmode Data Sets in the Age of Massive Public Databases: An Illustration
Using False Discovery Rates. Plos Genetics, June 20, 2008; 4(6): e1000098.