Yifu Xia
www.yifuxia.com • ac0b5w@r.postjobfree.com • 213-***-****
PROFILE
Driven and curious learner with strong self-learning ability. EDUCATION
Shanghai Jiao Tong University Sep 2011 - June 2015 B.S. in Computer Science (Information Security Engineering) Object Oriented Programming, Data Mining, Principles of Database, Probability and Statistics. University of Southern California Sep 2015 - May 2017 M.S. in Data Informatics
Analysis of Algorithms, Database Systems, Machine Learning, Information Visualization. WORK EXPERIENCE
Fullscreen, Inc Jan 2017 - Mar 2017
Data Science Intern
- Helped Youtube content creators to gain revenue by utilizing data science.
- Accomplished data acquisition and feature engineering single-handedly.
- Utilized random forest to rank important features; discovered optimal video parameters to boost views. Pastoral (Aiphy Technology) Nov 2015 - Mar 2016
Data Engineer
- Came up with innovative ways to implement data-driven agriculture.
- Populated sensor data into NoSQL database (MongoDB).
- Built a web application to display sensor data in real time and to control the smart device. PROJECT EXPERIENCE
Recommendation System for Blood Transfusion Volume Mar 2017
- Cooperated with Keck School of Medicine of USC to help eliminate blood transfusion waste during surgeries by utilizing the data they provided.
- Data cleaning and preprocessing using Python (Pandas).
- Built predictive regression models (Linear regression, KNN, Random forest) for red blood cells prediction during a surgery.
Customer App Behavior Data Analysis Mar 2017
- Built models (Random forest, Logistic regression, SVM) to predict potential customer with high value.
- Provided business insights based on the feature ranking of the predictive model. Yelp Restaurant Reviews Analysis Dec 2016
- Gained business insights by utilizing unsupervised machine learning techniques (Kmeans, SVD, NMF).
- Handled unbalanced classification problem.
Basketball Stats Data Visualization Nov 2016
- Scraped basketball data from web.
- Built interactive 3D visualization by utilizing WebGL technology. RESEARCH EXPERIENCE
Big Data Computing Platform Analysis Lab, SJTU, Shanghai Oct 2014 - May 2015
- Utilized machine learning methods (multi-class classification) to generate synthetic graphs for benchmarking various distributed computing platforms.
- Deployed distributed computing platforms such as Hadoop and GraphLab.