Post Job Free
Sign in

Data Analyst Python

Location:
Jersey City, NJ
Posted:
September 09, 2020

Contact this candidate

Resume:

YUE YANG

New York, NY 646-***-**** *******@***.*******.*** LinkedIn: www.linkedin.com/in/yue-yang-172099195 EDUCATION

Weill Graduate School of Medical Science, Cornell University New York, NY Master of Science in Health Policy and Economics GAP: 3.8/4.0 08/2020

• Major courses: Biostatistics (R), Data Science for Machine Learning, Artificial Intelligence in Medicine (Python), U.S. Healthcare Policy and Delivery, Modern Methods for Causal Inference, Applied Econometrics and Data Analysis Dalian University of Technology Dalian, China

Bachelor of Economics GAP: 3.6/4.0 06/2019

• One-year study in East China Normal University Shanghai, China as Exchange Student (Top 5% Only) SKILLS

• Technical: SQL(PostgreSQL), Python (NumPy, Pandas, Matplotlib), R(dplyr, Shiny, ggplot2), Stata

• Data Analysis: A/B Testing, Statistical analysis, Machine learning (Decision Tree, SVM)

• Commercial: Tableau, MS Excel(Array formulas, text manipulation), ACCA(Association of Chartered Certified Accountants) Advanced Diploma in Accounting and Business RELATED EXPERIENCE

Capstone Project Data Analyst New York, NY

Hospital for Special Surgery(HSS) 01/2020-07/2020

• Coordinated a team of 5 to analyze and evaluate current health care patterns and utilization of Arthroplasty services

• Extracted and merged over 110,000 target patients and hospital clinical reported data from relational databases with SQL and performed data cleaning and exploratory analysis on disparity and quality of surgery with R

• Conducted casual inference to investigate the effect of health policy on surgical outcomes and costs by applying statistical and econometric models

• Developed on-going reports for HSS clients including visualization of trends and interpretation of results to provide insights for decision making on service delivery

Data Analyst intern Shanghai, China

Pharmeyes Health Management Consulting Co.Ltd 06/2018-09/2018

• Implemented an interface in R using Shiny package to test models for the clients’ defect rates

• Performed the standardized Ad-Hoc analysis with HIVE and R-shiny into unified python codes

• Visualized the findings using python(Matplotlib) to present the outcome and predictive powers

• Practiced evaluation for Chinese health markets products and system and independently developed the report for assessing the clients’(including Johnson & Johnson and Smith & Nephew) markets performance ACADEMIC PROJECT

Prediction the need for intubation for COVID-19 patients New York, NY Cornell University, Data Science 06/2020- 07/2020

• Extracted features from the longitudinal vital signs measure (blood pressure, heart rate, SpO2 or blood oxygen levels) to create unique value of each patient for deeper analysis using R

• Built several predicative models including Logistic Regression, Random Frost, SVM to estimate the probability of incubation and optimized the final model with over 80% accuracy

• Conducted data analysis report to communicate with the authorities helping them understand key prediction factors Causality between customer churn and premium New York, NY Cornell University, Causal inference 07/2020-08/2020

• Formulated a structure causal model to investigate the effect of premium technical support service on reducing churn of customers based on IBM sample datasets from Kaggle

• Implemented parametric G-computation, inverse probability of treatment weighted estimator, AIPW, and TMLE to find the causal relationship by comparing the mean average treatment effect, statistical variance, and 95% confidence interval among those different methods

CERTIFICATE

• Intermediate python; Join Data in SQL; Data Science for Business; Healthcare Data Models; Neural Networks and Deeping learning



Contact this candidate