Jiarui Qi
*** * ********** ****, *** Jose, CA *5128 **********@*****.*** 703-***-****
RELEVANT EXPERIENCE
American Institutes for Research, Quality Control Assistant (Data Analyst), Washington, DC 02/2018- 04/2020
· Utilized proc SQL in SAS to automate comprehensive querying of score reports and streamline our QC process.
· Developed Python scripts (numpy, pandas, matplotlib) to automate the download of all online student records from our final Online Reporting System, then do the data cleaning and combine into a master file to help improve our quality control procedures.
· Tested scores across the United States with over 100, 000 observations by utilizing VLOOKUP and pivot table for accuracy and soundness to encompass all stages of the report analysis cycle.
· Conducted rigorous quality assurance checks on data integration for paper and online student score reports across 26 clients.
CABEL Foundation Inc, Research Assistant Intern, Washington, DC 09/2017-02/2018
· Responsible for data collection and data maintenance for grant applications.
· Conducted research into financial literacy education and generated Tableau dashboards for indicators visualization.
George Washington University CCAS, Student Consultant, Washington, DC 09/2016-12/2016
· Discussed with clients about their aims and understood the principle of lip-reading training experiment.
· Applied Generalized Linear Mixed Model to longitudinal data to find an effective lip-reading training method.
· Utilized R to detect variables which are significant to the result and evaluated the significance of 2-way and 3-way interaction.
EDUCATION
George Washington university, Washington, DC 09/2015-05/2017 Master of Science in Statistics; GPA: 3.78/4.0;
Relevant Coursework: Data Analysis, Survival Analysis, Time series Analysis, Mathematical Statistics, Applied Linear Models, Regression Graphics, Statistical Computing, Statistical Consulting
Xi’an Jiao tong- Liverpool university, Suzhou, China 09/2011-06/2015 Bachelor of Science in Applied mathematics; GPA: 70/80
TECHNICAL SKILLS
· SQL, R, SAS (Certified Advanced Programmer), Python (Numpy, Pandas, Matplotlib, Plotly), Excel (vlookup, pivot Table), Tableau
· Advanced statistical methods (ANOVA, a/b hypothesis testing, cluster analysis, time series, linear regression)
PROJECTS
Google Analytics Customer Revenue Prediction
· Leveraged 1.4M entries of Google Merchandise Store customer data to predict revenue per customer.
· Summarized main characteristics of the dataset and managed missing values by performing exploratory data analysis (EDA) and data visualization using Plotly and Matplotlib in Python.
· Built time-series model with R zoo and forecast packages for revenue forecasting and detected time features that can cause overfitting.
·Conducted Gradient Boosting Machine (GBM) model and improved GBM model performance utilizing feature engineering and hyperparameter tuning
Stock Investment Assistant
·Built web crawlers with Python and Scrapy to obtain fundamental information of financial market and specific companies through resources like Yahoo Finance and Wall Street Journal.
·Utilized Quandl Python module to acquire stock price datasets as the format of Pandas Dataframe.
·Developed self-designed model to filter high-quality stocks and deployed the integrated workflow on AWS.