Shizhe Zhang
Email: ********@*****.*** Cell: +1-929-***-**** New York, NY Linkedin
EDUCATION
Columbia University, Mailman School of Public Health 05/2025 Master of Public Health in Biostatistics, Certificate in Advanced Epidemiology Relevant courses: Data Science I & II, SAS, Applied Regression I & II, Mathematical Statistics, Analysis of Categorical Data, SQL, Epidemiology (REDCap)
University of Liverpool 06/2023
Bachelor of Science in Genetics
Xi’an Jiaotong-Liverpool University 06/2021
Bachelor of Science in Biological Science
SKILLS
Data Analysis: R, SAS, SQL, Python
Computer: Microsoft Office (Word, Excel, PowerPoint), LaTeX, Adobe Photoshop Languages: Mandarin (native), English (fluent)
RESEARCH AND DATA ANALYSIS PROJECTS
Columbia University, Department of Biostatistics 12/2024-present Research Assistant
Applied Functional Principal Component Analysis (FPCA) using R to uncover key behavioral patterns in mice learning curves, enhancing understanding of learning dynamics.
Executed phase and amplitude separation of learning trajectories, improving interpretation of behavioral variability.
Developed and implemented advanced data transformation techniques to optimize visualization and interpretation of complex learning behaviors.
Implementing several data transformations for improved visualization and interpretation of learning behaviors.
Evaluated and refined multiple smoothing methods to elevate data quality and analytical rigor.
Assisting in grant applications, contributing statistical analysis sections and data-driven justifications for research funding. Capstone Project: Proteomic Analysis of Mice Learning Behavior 01/2025-04/2025
Processed and cleaned publicly available proteomic datasets, ensuring comprehensive handling of missing data and robust preprocessing.
Leveraged machine learning to identify key proteins predictive of learning differences, enabling targeted biological insights.
Conducted logistic regression to elucidate relationships between protein expression and learning outcomes.
Designed data visualizations and synthesized findings into a comprehensive report, supporting methodological transparency. Center for Statistical Genetics, Columbia University Medical Center 09/2024-11/2024 Research Assistant
Led Quantitative Trait Loci (QTL) analysis using R, revealing genetic associations linked to neurodegenerative disorders in brain and blood samples.
Authored and refined comprehensive FUSION methodology protocols, facilitating clear guidance on data processing and interpretation for lab personnel.
Led data cleaning and quantitative analysis on genetic and health outcomes datasets, using advanced statistical programming and data management tools.
Produced polished deliverables including reports, data summaries, and technical documentation for academic and grant audiences.
Optimized preprocessing workflows by implementing rigorous quality control and data transformation techniques, significantly enhancing reproducibility and analytical precision. Comprehensive Study of Biostatistics and Data Analysis 07/2022-10/2022
Participated in a data camp focused on quantitative analysis and statistical programming using R.
Mastered R for advanced data analysis, including regression models, cross-validation, and statistical visualization after data collecting and data cleaning.
Produced a research report on Trisomy 21 Syndrome, leveraging data visualization tools to illustrate key findings.
Developed reproducible analytical workflows, ensuring methodological transparency and replicability for future research. SELECTED INTERNSHIP EXPERIENCE
South Bronx United, New York, NY 06/2024-08/2024
Research Intern
Collected, managed, and analyzed data on greenspace access and health outcomes in underserved, urban neighborhoods.
Utilized R and SAS to conduct statistical comparisons and generate actionable insights for community health initiatives.
Summarized evidence for effective interventions addressing health inequities and presented findings to support public reporting and advocacy.
Synthesized and summarized complex research findings to inform interventions in behavioral health and chronic disease prevention.
Incorporated community feedback into project reports and recommendations to ensure alignment with equity and community needs.