Hongxu Wang
adb6yt@r.postjobfree.com 515-***-**** www.linkedin.com/in/hongxu-wang
EDUCATION
Iowa State University, Ames, IA. Master of Science, Statistics. GPA 3.74/4.00 08/2017 – present Dalian Polytechnic University, Dalian, China. Master of Science, Food Science. GPA 3.94/4.00 08/2012 - 06/2015 Dalian Polytechnic University, Dalian, China. Bachelor of Science, Food Science. GPA 3.96/4.00 08/2008 - 06/2012 SKILLS
Computer Skill
• Programming: R, Python, SQL
• Platform and Tools: R Studio, MySQL Workbench, Jupyter.
• Others: Microsoft Office: Word, Excel, PowerPoint, Tableau LaTeX, GitHub, JMP, SAS
Statistical Skill
• Modeling: machine learning (linear/logistic
regression, classification, clustering), A/B testing, categorical data analysis, time series analysis,
sampling, ANOVA analysis, dimension reduction,
model selection.
PROJECT
Exploration of Car Crashes (R dplyr, ggplot2, shiny)
• Prepared Car Crashes datasets using high-performing data wrangling and visualization code in R.
• Spatial and time series analysis for where and when accident happens by creating interactive map and plotting the smoothed periodogram.
• Casualty analysis by building multiple logistic regression model to determine the likelihood of injury with 88 % accuracy
• Visualized the data analysis results by building information dashboard using R shiny. Genotyped allotetraploid and SNP detection (R, C/C++)
• Extracted, cleaned and analyzed genomic and phenotypic data from biological sciences
• Applied likelihood ratio hypothesis test to test for variations in allotetraploid sequence data
• Builded predictive models using EM algorithm to genotype allotetraploid.
• Accomplished genotyping allotetraploid and SNP detecting Exploration of Women’s E-commerce Clothing Reviews (Python Numpy, Pandas, Scikit-Learn, Matplotlib,Seaborn)
• Used Python NLTK to do test-prepossessing and sentiment analysis on over 20000 text reviews to understand the correlation of different variables in customer reviews on a women clothing e-commerce
• Implemented the TF-IDF methods and Built classification models to predict whether a customer will recommend the reviewed product or not based on Naïve Bayes classifier and Logistic Regression, which obtained 83% and 86% accuracy respectively
• EXPERIENCE
Department of Statistics, Iowa State University, Ames, Iowa Teaching Assistant (R and JMP) 08/2017 – present
• Responsibility: Helped design the course, construct tests, prepare materials and grade assignments; Provided assistance in the lab when conducting curriculum-based group lessons using R or JMP to do data visualization or data analyses; Worked with students one-on-one, learn about problems they are having with the course after class. National Engineering Research Center of Seafood, Dalian, China Research Assistant (PCA, SPSS) 08/2012 - 06/2015
• Statistical Analysis in Food Science: Investigated the effects of heating conditions on the fatty acids and volatile compounds in abalone using PCA (Principle Component Analysis); Demonstrated heating at 80 for 0.5-2h generated higher contents of volatile compounds which are mainly hexanal, heptanal, octanal, nonanal, and undecanal. SELECTED AWARD
National Scholarship of China, Ministry of Education of the People’s Republic of China 2014. This award is for outstanding graduate students in China.