Ruowen Wang
949-***-**** ad2ghs@r.postjobfree.com https://www.linkedin.com/in/ruowenwang1997/ Location: Santa Ana EDUCATION & CERTIFICATIONS
University of Southern California. CA, United States Jan.2020 - May.2022 Master of Science in Biostatistics Cum GPA: 3.72/4.0 State University of New York at Binghamton. NY, United States Aug.2015 - May.2018 Bachelor of Science in Financial Economics Cum GPA: 3.03/4.0 SAS Base Certificate Aug.2023
SKILLS
Data Analytics: SAS, MS Excel, A/B Testing, Casual Inference, Tableau, Power BI, STATA
Machine Learning: Python (Scikit-Learn), R, SPSS Predictive Modeling, Hypotheses testing, Regression Analysis
Database Processing: SQL, Spark, UNIX
PROFESSIONAL EXPERIENCES
XpertDox, LLC., Birmingham, AL, USA Feb.2023 - Present SAS Programmer: Develop autonomous medical coding process (SAS)
Maintained and developed the automation process of transforming EMRs to JSON documents by using SAS
Generated daily reports to the clients, such as American Family Care, Arkansas Urology, etc.
Completed Ad-hoc requests, such as filtering medical claims and debugging daily codes Jiangsu Hengrui Pharmaceuticals Co., Ltd., Shanghai, China Jul.2022 - Jul.2023 Statistical Programmer: Support to generate clinical datasets to deliver the TLFs for CSR (SAS)
Prepared and maintained the main side SDTM and ADaM datasets for the following projects for New Drug Application: SHR- 1314-301, SHR-1314-ISS, SHR-1316-III-301, SHR-A1921-I-101
Finished QC of TLFs for project SHR-1314-301 during the first and the second dry run, improved the accuracy of the outputs
Completed induction training and conducted presentations of the CDISC standards, FDA Guidance, ICH Guidelines (such as E6, E8, E9, etc.) and Drug Development in Oncology (RECIST 1.1 Guidelines) A&C Future, Inc., Newport Beach, CA, USA Aug.2022 - Feb.2023 Data Analyst (Market Research): Segmentation analysis and business administration of a startup company (Microsoft Office Suite)
Conducted marketing research for American RV industry by product type, brand, competitors’ pricing strategy, marketing campaign, etc., helped the mechanical engineering team improve decisions Harbdata Technology Co., Ltd., Shanghai, China Oct.2021 - Jun.2022 Data Analyst Intern: Analyze e-commerce data for Huggies and Kleenex (Python Spark)
Worked with analysts to retrieve transaction-level data from JD.com data warehouse and Tmall (Alibaba) databank, loaded the data to normalize for periodic reporting with Python Spark
Conducted market segmentation analyses by purchasing time, SKU, product type, brand, competitors’ pricing strategy, number of views, marketing campaign, etc.
Maintained an Excel Weekly Sales Report for Huggies diapers and an Excel Monthly Sales Report for Kleenex moist toilet paper including periodic comparisons, trends of preference, and sales performance Data Analysis for Alibaba Group Aug.2021 - Sep.2021 Data Analyst Intern: Enhance the functionality of risk management based on users’ portraits (SQL)
Engineered 161k customer data from database and extracted and engineered 15+ features including WOE, IV index, records etc.
Estimated the customer lifetime value by using RFM model based on users’ purchasing history on AliExpress
Predicted the probability of user fraud behavior with existing customers across 6 marketing channels in gradient boosting regression and linear regression model, achieved 97.3% accuracy, and lifted cross-sell rate by 20%
Optimized the marketing campaign by feeding customized content to 5 customer clusters, achieved 70% boost for performance GF Securities Co., Ltd., Beijing, China Mar.2019 - Oct.2019 Assistant Analyst Intern: Focus on the technology, media, and telecom (TMT) Industry (Tableau)
Developed dashboard report automation system to create 3 interactive Tableau market reports weekly with live streaming performance database, saved 30% of the reporting time
Participated in and conducted meeting summaries based on corporates' quarterly and annual reports, focused on helping analysts to build new tracking databases and stock price forecasting models
Collaborated with cross-functional teams to understand requirements and merged the data from 15 branches KEY COURSE PROJECTS
Research on identifying the environment-disease relationships by using datasets of stimulated cells Feb.2021 - May.2022 Verify the relationships of environment and risk of diseases through differentially expressed genes (R/UNIX)
Developed 3 datasets of differentially expressed genes after stimulation by more than 50 environments with R and UNIX
Created the gene-set SNP-annotations for stratified LD score regression by cS2G and 100kb approaches
Tested the selected approaches for >50 environments across 100 human diseases and complex traits of GWAS summary statistics, to select the best strategy to create SNP-annotations by constructing a gold-standard list of environment-disease pairs
Applied the final strategy to detect new candidate environment-disease relationships and validate the original relationships through 19,966 total tests