Post Job Free
Sign in

Data Internal Medicine

Location:
Philadelphia, PA
Posted:
December 10, 2012

Contact this candidate

Resume:

Lin Yang

Office address:

**** ******** ****

*** ******** *****

Division of General Internal Medicine

University of Pennsylvania

Philadelphia, PA 19104

Telephone : (215) 573 - 6735

Email: *******@****.***.*****.***

PROFESSIONAL SKILLS

Statistical Analysis/Modeling:

Analytical, results-driven professional with skills in statistical data analysis/s tatistical modeling, SAS, STATA

& R/S-plus programming, regression analysis, nonparametric statistical methods, survival analysis and design

of experiments. Comprehensive knowledge of applied statistical methodologies including:

Regression analysis (univariate and multivariate linear regression, nonlinear regression including

logistic, poisson, negative binomial regression) & linear model (generalized linear model, hierarchical

linear models including mixed effect, random effect and fixed effect model) ;

Survival analysis (Kaplan-Meier method to estimate survivor function, cox regression for proportional

hazords model);

Propensity score analysis in observational study, propensity score matching combined with

Mahalanobis distance matching;

Categorical data analysis, non-parametric statistical methods (bootstrap technique, binomial test, sign

test, Wilcoxon signed rank test, McNeman test, Mann-Whitney test, randomization test, Kruskal-wallis

test, Friedman s test, Kolmogorov test, contigency table, etc.) ;

Model variable selection (forward, stepwise, backward model selection), model assumption diagnosis,

interval estimates and hypothesis testing;

Mixture model of uniform & Beta distribution, ranking techniques in high dimensional data analysis,

i.e., microarray data analysis, and simulation study for plasmode dataset ;

Design of experiments ( randomized blocks, factorial design, latin squares, fractional factorial design).

Statistical Software:

SAS: Import, integrate, and validate data from various raw data sources into SAS, combine da ta sheets,

manipulate and tramsform data, perform data cleaning, dataset linking and analytic dataset formation,

create list, tabular, and summary report, produce plots, pie and bar charts, and implement statistical

procedures using SAS (BASE, STAT, GRAPH, SQL, MACRO) in the UNIX environment;

STATA: Statistical analysis implemented in STATA, including fixed effect and random effect model,

cox proportional hazards model, multilevel mixed-effects logistic regression, negative binomial

regression, etc.;

R/S-plus: One year s programming experience on simulation study using high-dimensional plasmode

data set, bootstrap technique, graphing, and statistical functions;

Data management:

Manipulation of large administrative healthcare data sets (ie. Medicare claims data) using SAS

statistical software in the UNIX environmen.

PROFESSIONAL EXPERIENCE

September 2011 ~ current Biostatistician

Division of General Internal Medicine, UNIVERSITY OF PENNSYLVANIA PHILADELPHIA, PA

Projects involved:

Advancing instrumental variable methods in comparative effectiveness research;

PCMH(Patient-centered Medical Homes) and ACO(Accountable Care Organizations) external

validation study.

Major tasks and Accomplishments:

Manipulation and management of large administrative healthcare datasets such as Medicare claims data

or insurence claims data, perform data cleaning, dataset linking and creation of analytic datasets;

Implement statistical analyses by using SAS & STATA statistical packages, write program in

SAS/STATA for :

Nonbipartite matching and its statistical applications in SAS/IML software and STATA mata

language

Univariate and multivariate linear or non-linear regression, including least square regression,

logistic, poisson, negative binomial regression

Propensity score compuation, and propensity socre matching combined with Mahalanobis

matching

March 2008 ~ August 2011 Programmer Analyst

Division of General Internal Medicine, UNIVERSITY OF PENNSYLVANIA PHILADELPHIA, PA

Projects involved:

Implications of cardiovascular technology diffusion among Medicare beneficiaries;

Comparative effectiveness of cardiovascular technologies and Medicare cost growth: 2001 -2008;

In-hospital cardiac arrest (IHCA) rates in US hospitals.

Major tasks and Accomplishments:

Manipulation and management of large administrative healthcare datasets such as Medicare claims data,

perform data cleaning, dataset linking and creation of analytic datasets;

Implement statistical analyses by using SAS & STATA statistical packages, write program in

SAS/STATA for :

Generalized linear model, Mixed effect, random effect and fixed effect regression for both

continuous or binary dependent variable

Kaplan-Meier estimator of survivor function, cox regression in survival analysis

Univariate and multivariate linear or non-linear regression, including least square regression,

logistic, poisson, negative binomial regression

Propensity score compuation, and propensity socre matching combined with Mahalanobis

matching

Provide internal reports indicating the results of the computational tasks, draft plots(pie or bar chart,

bubble plots, Venn diagrams, SAS maps, etc.) for manuscripts.

August 2006 ~ August 2007 Graduate Research Assistant

Department of Mathematics & Statistics, UNIVERSITY OF MISSOURI-ROLLA ROLLA, MO

Projects involved:

A comparison of ranking techniques and FDR (False Discovery Rate) approach for the analysis of

microarray gene expression data;

Evaluating statistical methods by simulating high-dimensional plasmode data sets;

Major tasks and Accomplishments:

Write progam in R/S-plus in computing the rank of each gene based on t-statistic ( or p-value) and the

bootstrap estimates for probability of gene selection;

Write progam in R/S-plus in computing FDR(False Discovery Rate), TP(True Positive) for the

microarray gene expression data using the mixture model approach;

Write program in R/S-plus to do simulasion study using plasmode data set;

Write program in R/S-plus to produce plots for the results of comparing statistical methods using

simulated high-dimensional plasmode data set;

1997-2001 Civil Engineer

BUILDING DESIGN & RESEARCH INSTITUTE OF ZHEJIANG UNIVERSITY OF

TECHNOLOGY HANGZHOU, CHINA

Designed various building structures by using PK-PM CAD Structural Software

Provided AutoCAD drafting for construction

CERTIFICATION

SAS Certified Advanced Programmer for SAS 9

EDUCATION

2005-2007 Master of Science in Applied Mathematics with Statistics Emphasis

UNIVERSITY OF MISSOURI-ROLLA ROLLA, MO

Overall GPA : 4.0 /4.0

Member, American Statistical Association

Research report: Assessing the properties of a ranking technique for detecting differentially

expressed genes from a microarray experiment.

Advisor : Dr. Gary L. Gadbury

Course:

Statistical Data Analysis Regression Analysis

SAS Programming Nonparametric Statistical Methods

Design and Analysis of Experiments Linear Models I & II

Probability and Statistics Mathematical statistics

Advanced Calculus I &II

1993-1997 Master of Engineering in Civil Engineering

ZHEJIANG UNIVERSITY HANGZHOU, CHINA

Overall GPA : 3.31/4.0

1989-1993 Bachelar of Engineering in Civil Engineering

ZHEJIANG UNIVERSITY HANGZHOU, CHINA

Overall GPA : 3.44/4.0

Academic Excellence Scholarships for three consecutive years

Exemplary graduate at Zhejiang University

Exemplary graduate in Zhejiang Province

PUBLICATIONS

1. Raina M. Merchant, Lin Yang, Lance B. Becker, Robert A. Berg, Vinay Nadkarni, Graham Nichol, Brendan

G. Carr, Nandita Mitra, Steven M. Bradley, Benjamin S. Abella, and Peter W. Groeneveld. Variability in

Case-mix Adjusted In-hospital Cardiac Arrest Rates. Medical Care, February 2012 ; 50(2): 124-130.

2. Peter W. Groeneveld, Daniel Polsky, Feifei Yang, Lin Yang, Andrew J. Epstein. The Impact of New

Cardiovascular Device Technology on Health Care Costs. Arch Intern Med, July 25, 2011; 171(14): 1289-

1291.

3. Andrew J. Epstein, Daniel Polsky, Feifei Yang, Lin Yang, Peter W. Groeneveld. Coronary Revascularization

Trends in the United States, 2001-2008. JAMA, May 4, 2011; 305(17) : 1769-1776.

4. Peter W. Groeneveld, Andrew J. Epstein, Feifei Yang, Lin Yang, Daniel Polsky. Medicare's Policy On

Carotid Stents Limited Use To Hospitals Meeting Quality Guidelines Yet Did Not Hurt Disadvantaged.

Health Affairs, Febuary 2011; 30(2) : 312-321

5. Peter W. Groeneveld, Lin Yang, Alexis Greenhut, Feifei Yang. Comparative Effectiveness of Carotid

Arterial Stenting Versus Endarterectomy. Journal of Vascular Surgery, November 2009; 50(5) : 1040-1048.

6. Gary L. Gadbury, Qinfang Xiang, Lin Yang, Stephen Barnes, Grier P. Page, David B. Allison. Evaluating

Statistical Methods Using Plasmode Data Sets in the Age of Massive Public Databases: An Illustration

Using False Discovery Rates. Plos Genetics, June 20, 2008; 4(6): e1000098.



Contact this candidate