Post Job Free
Sign in

Senior Statistician & Data Scientist with 9+ Years Experience

Location:
Menlo Park, CA, 94025
Posted:
December 08, 2025

Contact this candidate

Resume:

Curriculum Vitae

Yousif Alyousifi, Ph.D.

Address: Glencoe Hills Dr, Ann Arbor, MI 48108

Mobile: 313-***-****

Email: ************@*****.*** ********@***.*****.*** Professional Summary

Senior Statistician and Data Scientist with over 9 years of academic and research experience in statistical modeling, machine learning, and deep learning. Demonstrated expertise in developing and validating non-invasive predictive models for liver disease (MASLD, MASH, fibrosis, cirrhosis, HCC, and liver-related mortality) across large-scale cohorts, including UK Biobank, Framingham, MGI, and AllofUs. Skilled in regulatory-grade statistical methods, survival analysis, and advanced machine learning techniques.

Professional Experience

University of Michigan Medical School – Ann Arbor, MI Senior Statistician (Research Fellow) 2023 – Present

• Designed and validated statistical and ML models predicting fatty liver disease outcomes, outperforming current indices.

• Applied R, Python, SQL, and Spark for data cleaning, modeling, and visualization on biobank and clinical datasets.

• Conducted survival/time-to-event modeling (Cox, AFT, C-index bootstrap CI) for liver outcomes, and validate predictive performance.

• Developed risk prediction scores (ESI, MASH-CB, MASLD-PRS), improving early detection and stratification, and implemented transfer learning approaches for non-IID datasets

• Collaborated cross-functionally with clinicians, geneticists, and data scientists to deliver actionable insights for translational medicine.

Universiti Putra Malaysia (UPM) – Selangor, Malaysia Assistant Professor (Statistics) 2021 – 2023

• Taught Statistical Modeling, Inference, and Data Analytics at undergraduate/graduate levels.

• PI of a machine learning grant in air pollution modeling; successfully secured funding.

• Supervised MSc/PhD students; published research in environmental analytics. Universiti Teknologi Malaysia (UTM) – Johor, Malaysia Postdoctoral Researcher (Statistics) 2021

• Built forecasting and Bayesian models for environmental and health risk monitoring.

• Delivered cross-disciplinary insights for policy-driven research. Universiti Teknologi PETRONAS (UTP) – Seri Iskander, Malaysia Instructor / Data Analyst 2018 – 2020

• Developed curriculum in data analytics & applied mathematics using R/Python.

• Applied ML methods to energy/environmental datasets, publishing in peer-reviewed outlets.

Additional Teaching & Analyst Roles 2007 – 2018

• Lecturer at University of Science & Technology (Yemen), Thamar University (Yemen).

• Math & Statistics teacher and institutional data analyst at IMAS (Putrajaya, Malaysia). Education

• Ph.D. Applied Statistics – Universiti Kebangsaan Malaysia (UKM), 2021

• M.Sc. Statistics – UKM, 2017

• B.Sc. Mathematics (First-Class Honors) – Thamar University, Yemen, 2007 Selected Achievements

• Developed ML-derived non-invasive scores for MASLD, MASH, and liver mortality.

• Secured two competitive research grants (UPM, UKM).

• Published 20+ peer-reviewed papers (Springer, Elsevier, IEEE).

• Presented at AASLD, ASHG, Mystat, and global conferences.

• Recipient of UKM Excellent Thesis Award (2021) & Graduate on Time Award. Technical Skills

• Statistical Tools: R, Python, SQL, Spark, Shiny, SPSS

• Libraries: Scikit-learn, Lifelines, Optuna, Tidyverse, Cart, TensorFlow, PyTorch

• ML/AI Expertise: Penalized regression, Random Forest, XGBoost, CatBoost, LightGBM, CNNs,

• Statistical Methods: Bayesian inference, MCMC, GLMMs, time-series, survival analysis, causal inference, multivariate analysis.

• Clinical/Regulatory Experience: Biobank analyses, risk stratification, biomarker evaluation, longitudinal modeling

• Visualization: ggplot2, matplotlib, tidyplot, seaborn, plotly

• Methods: Cross-validation, bootstrap, decision curve analysis, SHAP, feature engineering.

• AI: Experience with training and inference of large-scale AI models such as Large Language Models (LLMs), multimodal models, and reasoning models.

• Collaboration: Statistical consulting, cross-functional teamwork, mentoring diverse trainees Key Projects

Clinical & Genetic Risk Modeling for Liver Disease

• Built multi-cohort ML models for MASLD/MASH detection, fibrosis prediction, and mortality risk.

• Applied transfer learning across UKBB, FHS, MGI and AllofUS cohorts.

• Delivered results at AASLD/ASHG 2023–25; manuscripts under review at Journal of Hepatology and Liver International.

Stochastic & Bayesian Models for Air Pollution

• Developed spatio-temporal Markov chain and Bayesian frameworks for air quality forecasting.

• Published in Q1 journals (Springer, IEEE Access, Elsevier). Publications & Presentations (Selected)

• 20+ peer-reviewed ISI Q1/Q2 publications in applied statistics and machine learning (Full publication list available on Google Scholar)

• Presenter at AASLD, ASHG, UMCGR GI Retreats, and Internal Medicine Research Day.

• Referee for leading journals: IJFS, Knowledge-Based Systems, Environmental Science & Pollution Research.

Honors & Awards

• Putra Research Grant (PI), Universiti Putra Malaysia, 2022

• Excellent Thesis Award, Universiti Kebangsaan Malaysia, 2021

• President’s Award for Distinguished Students, Yemen, 2008

• Graduate Research Assistant Fellowship, UKM, 2017 Memberships

• American Association for the Study of Liver Diseases (AASLD)

• American Society of Human Genetics (ASHG)

• American Statistical Association (ASA)

References contact: Upon request



Contact this candidate