Post Job Free
Sign in

Data Assistant

Location:
Newark, CA, 94560
Posted:
July 11, 2017

Contact this candidate

Resume:

Huihui Zhang

Address: **** ***** ******* ***. ******, CA 94560 Email: ***************@*****.*** Tel.: 213-***-**** Summary: In-depth knowledge in logistic regression, linear regression, ANOVA and statistics. 4+ years experience on using various statistical software including SAS, SQL, Python to do data cleaning, analysis, visualization and modeling. Education

University of Southern California (USC) 08/2014-08/2016 Keck School of Medicine of USC (Master of Science in Applied Biostatistics) Coursework: Data Analysis, Design of Experiment, Object Oriented Programming (JAVA), Principle of Biostatistics, Epidemiology, Database System, Statistical Methods. China Agricultural University (CAU) 09/2009-07/2014 College of Veterinary Medicine (Bachelor of Agriculture in Veterinary Medicine) Skills

Analytical Tools: SAS (SAS Base, SAS Macro, SAS SQL), Python (Numpy, Pandas, Scikit-learn), R (ggplot2, caret), STATA Database: SQL, Oracle database, MySQL Server, Hive, Spark, Pig Modeling Skills: Linear Regression, Logistic Regression, Random Forest, Neural Network, Decision Tree, KNN, Perceptron, Support Vector Machine, Clustering

Data Visualization: Spotfire, Tableau, R(ggplot2)

Certification: SAS Certified Advanced Programmer for SAS 9 (SAS, License AP017766v9, Starting July 2016) Work Experience

Business Technology Consultant, Genentech, Inc., South San Francisco, CA 10/2016-Present

Maintain SQL queries with multi-table joins, group functions, sub-queries to extract data from Oracle database and populate tables for daily reporting.

Develop Python scripts using Pandas and Numpy to extract data from database, cleaned data sets that come from clinical trials and perform statistical analysis.

Perform extensive QC (Quality Check) and analysis in reviewing other team members work as well as render primary support and assistance in data validation and data cleaning in all phases of Clinical studies

Generate Cross table, Bar chart, Tree map and complex reports with Spotfire and Tableau to visualize the data. Build Dashboards with filters, parameters and sets in Tableau to handle views more efficiently. Connect Oracle database and Google Spreadsheet to Spotfire and Tableau to keep the visualization up to date. Research Assistant, University of Southern California, Los Angeles, CA 02/2016-06/2016

Compared the prevalence of dispatcher-assisted cardiopulmonary resuscitation (CPR) before and after institution of Los Angeles Fire Department Tiered-Dispatch Card System (LATDS)

Coordinated with the investigators to interpret the analysis results and to develop statistical analysis plan (SAP).

Implemented SAS procedures to generate datasets, executed data cleaning and missing value imputation. Performed statistical analyses including Mann-Whitney test, Cohen's kappa test and logistic regression with SAS procedures: proc import, proc freq, proc logistic, proc univatiate, proc contents, proc npar1way, etc.

Concluded that LATDS significantly improved the speed with which call-takers recruited callers to start chest compressions in cases of out-of-hospital cardiac arrest. Research Experience

Relationship of Blood Pressure and Antihypertensive Medications to Cognitive Change 10/2014-06/2015

Executed data cleaning and missing value imputation using SAS. Built multiple linear regression models to evaluate associations of blood pressure, cognitive change and blood pressure medication by programming in SAS.

Concluded that higher systolic blood pressure and pulse pressure were associated with some aspects of cognitive decline, and that central agonists and diuretics may reduce cognitive decline. Publication

Sanko S G, Zhang H, Lane C, et al. 155 Classification of Bystander CPR Using 911-Call Review Versus Field Report[J]. Annals of Emergency Medicine, 2016, 68(4): S62.



Contact this candidate