Lei Cai
Email: *********@*****.*** Cell Phone: +1-412-***-**** Address: 4910 Centre Ave., Pittsburgh, PA 15213
Linkedin: http://www.linkedin.com/pub/lei-leila-cai-tsai/5b/4ab/166
Education
M.S., Information Science, University of Pittsburgh, Pittsburgh PA 08/2012- 12/2013 GPA: 3.89/4.0
B.S., Computer Science, Beijing Language & Culture University, China 09/2008-07/2012 GPA: 3.59/4.0
Exchanging Student, Computer Science, Murray State University, Murray KY 08/2009-05/2010 GPA: 3.90/4.0
Professional Skills
Languages: C/C++, C#, Java, SQL, Visual Basic, MS Office
Database: SQL Server, MySQL, MongoDB
Package: Dreamweaver, NetBeans, Microsoft Office, Eclipse, Visual Studio, ArcGIS Desktop, Matlab, R studio
Career Objective
Pursuing an opportunity in Information Technology (entry level) to be able to develop my ideas, knowledge and ski lls in database
management, data analysis, and software testing.
Internship
Testing Engineer, Huaweisoft 05/2013-07/2013
nd
Project: Guangzhou Construction Projects Contract Online Filing S ystem (2 Phase)
Contract parties fill the contracts forms online, submit to audit staff to review and confirm the contracts. Audit staff has rights
to return the review requirement from parties if the contract is not qualified.
I designed test cases using test methodologies, implemented the cases, adjusted the cases and methods to widely cover all
functions and flaws, and communicated with project manager and developers to check possible reasons for flaws.
Academic Experience/Projects
vShare: Platform for Volunteer Experience Tracking and Communication 02/2014- now
A platform that let users keep track of their volunteer experience and share with others.
Using social media data to determine what a user is mainly interested in about their volunteer.
My job is designing the Android application for the platform.
Functions achieved so far: User log in and sign up, new posts tabs.
Language using: Java
Relationship between LinkedIn use and job hunting success rate 09/2013- 12/2013
Course project for Social Computing
Study relationship between LinkedIn profile features and users job hunting outcomes or skills ’ endorsement
Conduct survey to collect respondents’ demographic information as well as LinkedIn using behavior information.
Collecting social data, and implement linear regression and other data mining algorithms for prediction.
One of our outcomes by now was that the number of groups users joined has posi tive influence on their skills endorsement
count. More relationships have been revealed by our study.
Method used: Linear Regression, Snow Ball sampling.
Predict which Xbox game a visitor will be most interested in based on their search queries 09/2013-12/2013
Course project for Data Mining
A closed competition from Kaggle.com to predict popularity of Xbox games based on users’ search queries and behavior.
Find top five popular Xbox games after analysis data.
Using data mining and machine learning algorithm to analysis data, which included 42365 samples.
Feature selection, texting mining and comparing different algorithms based on their attributes were main tasks.
Method used: Data Mining, Machine Learning Algorithm: Logistic Regression, SVM, Naï Bayes.
ve
Language used: R, Python.
Predict survival on the Titanic 01/2013- 05/2013
Course project for Data Analyst.
Using structured data including passenger information, tickets fare and socio-economic status to predict what kinds of
passenger have higher possibility to survive from Titanic disaster.
1
We tried different methods such as SVM, Naï Bayes, and Random Tree to build the prediction model. Disctretized the ages
ve
and replaced the missing value with the average age. Used leave one out cross validation to text our models.
Made assumption about what were the crucial features that keep passenger alive then proved them.
Got 81% accuracy on Kaggle evaluation and ranked 23 out of 2547 teams at that time.
Software used: GeNie, Excel, Weka
Geographical Navigation System Based on Air Pollution Exposure 01/2013- 05/2013
Compute air pollution exposure weight of road segment and choose optimal route option s for users, based on EPA database
and Pennsylvania road network.
Using distribution system and sending dataset and computation modules to open source cloud to shorten computation time of
road segments.
Broaden road map from PA to the United State.
Software used: ArcGIS, MySQL
Asian Cuisine& Recipe Recommendation System 01/2013- 05/2013
Course project for Adaptive System Design
Asian cuisine and recipes recommendation system.
Build users personal profile explicitly, combine users profile and search query to recommend cuisine and recipes.
Using hybrid model including collaborative filtering and content based models to generate recommendation.
Software used: MySQL, Myeclipse, Navicat
Language used: Java, javascript
Web Link: http://washington.sis.pitt.edu/asian_cuisine/
Front-End Design of EMR database system 10/2012- 05/2013
Course project for Database Management and Advanced Database Management.
Keep track of patient’s medical records as they are examined at different medical facilities.
Design front-end interface of system. Including login, records input, records search, etc.
In future development, the system will enhance its functionality by designing more interfaces.
Software used: MySQL
Language used: html, css, php
Professional Courses
Web Technologies and Standards, (Advanced) Database Management, Adaptive system design, Data Analysis, Independent Study
on Geographic Information System (GIS), Data Mining, Social Computing, Interactive System Design
Honors and Awards
iFest: Books and Bots, award 1st place prize, School of information Science, University of Pittsburgh 04/2013
Kaggle Competition: Titanic: Machine Learning from Disaster. Ranking 23/2647
TeamID lcai)
(http://www.kaggle.com/c/titanic-gettingStarted/leaderboard 05/2013
2