Post Job Free
Sign in

Data Analyst

Location:
Atlanta, GA
Posted:
November 15, 2020

Contact this candidate

Resume:

Xiaochu (Shelly) Lin

E-mail: **************@***.*** Tel: 206-***-****

Add: 1198 Kendron Lane, Atlanta, GA, 30329

EDUCATIONAL BACKGROUND

Emory University, Atlanta, GA 08/2019-05/2021

MSPH Biostatistics

Relevant Coursework: Generalized Linear Models, Longitudinal and Multilevel Data Analysis, SAS Programming, R Programming, Machine Learning, Applied Regression Methods

University of Washington Seattle, WA 09/2015-06/2019 BS Biology

Mary Gates Hall Research scholarship

SKILLS

• Proficient in programming languages (R, SAS, SQL, Python, Power BI)

• Deep understanding of probability and statistical inference theory

• Proficient in statistical methodology such as applied regression models, survival analysis, longitudinal and multilevel data analysis, statistical computing, and model building

• Highly skilled in data cleaning, data pre-processing, data mining and data visualization RESEARCH / WORKING EXPERIENCE

Data Analyst Intern

Centers for Disease Control and Prevention 11/2020-present

• Perform analytics and data visualization to identify at-risk populations to reduce COVID-19 spread

• Utilize Power BI and R to create tables and figures using JHU and WHO database to keep communities informed at the city, county, state and national levels

• Produce weekly reports and presentations about COVID-19 cases and vaccine progress Research Assistant

Emory University Department of Biostatistics and Bioinformatics 05/2020-present

• Conduct literature review in the field of machine learning with cutting-edge scRNA-seq clustering techniques

• Pre-process large scRNA-seq data using R and implemented supervised and unsupervised clustering method with R packages including scmap, CHETAH, CellAssign, Garnett, Seurat, SC3 etc.

• Evaluate Adjusted Rand Index (ARI) to assess performance of the classification methods

• Perform data visualization with t-SNE and UMAP plots that better visualize clusters of data by reducing dimensionality which also helps identify issues of unassigned labels Teaching Assistant

Emory University Department of Biostatistics and Bioinformatics 08/2020-present

• Holding office hours, conducting lab sessions, and grading homework for Bios500: Statistical Method. Research Assistant

University of Washington, Chen Lab 06/2018-06/2019

• Project: Identification and characterization of kinases candidates that regulate the ability of self-renewal in rhabdomyosarcoma cancer stem cells

ACADEMIC PROJECT EXPERIENCE

Applied Regression Models

• Applied multiple logistic regression models, GAM, ANOVA and multiple comparison methods such as Scheffé to analyze significant risk factors for Type II diabetes in clinical data

• Performed independent statistical plan and data cleaning, built efficient statistical model with SAS and R which found BMI, glucose level and family history of diabetes to have statistical impact on whether a patient had diabetes Clinical Trials Methodology

• Utilized SAS macros for data analysis in pharmaceutical industry including logistic regression, Cox proportional hazard model, generalized linear model, etc.

• Conducted data pre-processing in NCBI brain data in SAS and utilized SAS macros to automate the process of statistical summary report

• Interpreted and presented scientific reports with team member about statistical difference in survival outcome by different treatments



Contact this candidate