Yuanzhe Cai
Address: College St. ***, Apt B, Arlington, TX 76010
Email: acedms@r.postjobfree.com
Telephone: 817-***-****
Objective
To obtain a position to data scientist
Skills
Focus on ranking problem in social networks and recommendation system;
Analysis of user’s behavior
Experience on collecting and analyzing large data;
Solid knowledge in Data Mining, Information Retrieval and Database system;
Hands-on database kernel components for PostgreSQL and Kingbase;
Strong programming skills in Java, C, C++, Matlab, SQL, PLSQL;
Familiar with lucence, Weka
Education
Ph.D.: Computer Science and Technique, 05/2014, University of Texas at Arlington, GPA: 3.78/4.0
Master: Computer Application Technique, 07/2008, Renmin University of China, GPA: 3.1/4.0
Bachelor: Software Engineer, 07/2005, Xidian University, GPA: 3.6/4.0
Accomplishments
Online Community Analysis (e.g., Facebook, Yahoo! Answers, Stack Overflow, etc):
Claw more than 30G web original data
Manage more than 10 million question and answers pairs
Design efficiently and effectively algorithms to analyze answerer’s behavior
Calculate user’s expertise
Recommend the questions to the proper users
Database Kernel Development (PostgreSQL 8.3):
Develop result set cache
Performance monitoring tools
Take part in the China PostgreSql open source community
Selected Professional Experience
09/2009-05/2014 Identifying Expertise and Answer Quality in Q/A Social Networks (Java)
We developed the algorithms to identify the generalist and specialist in the Q/A community.
According to the user’s expertise for a given question, we automatically routed a new question to a proper user.
10/2005-03/2006 SQL result set cache (C, Linux)
In order to improve the performance of SQL query, we developed a system to cache the sql results.
We modified the source code of PostgreSql and created the memory cache to store the results of a query.
We implemented both client memory cache and share memory cache.
Our program was tested by the TPC-C test.
This result set cache was used in Kingbase v4.1.
07/2006-01/2007 Database Performance Monitoring (C, Linux)
We developed a group of database views to monitor the database performance, such as io, buffer, file, lock,
event, log information.
Yuanzhe Cai
This monitor was used in Kingbase v.4.1.
03/2006-07/2006 Cadre evaluation system of the CPC Central Committee (VB, PowerDesigner,
Kingbase 4.1)
We developed the cadre evaluation system for the CPC Central committee.
This system was used for cadres election for 23 provinces in China.
02/2007-12/2007 Code system development (ontology management system) (Java, PLSQL, Oracle)
We implemented the different kinds of relationship, instance and class for ontology.
This system was used in the Database & Intelligent Information Retrieval Lab.
Selected Publications
Yuanzhe Cai and Sharma Chakravarthy. Answer Quality Prediction in Q/A Social Network by
Leveraging Temporal Features. IJNGC’13
Yuanzhe Cai and Sharma Chakravarthy. Expertise Ranking of Users in QA Community Features.
DASFAA' 13
Yuanzhe Cai and Sharma Chakravarthy. Pairwise Similarity Calculation of Information Networks.
DaWaK'11
Yuanzhe Cai, Miao Zhang, Dijun Luo, Chris H. Q. Ding, Sharma Chakravarthy. Low-order tensor
decompositions for social tagging recommendation. WSDM'11
Yuanzhe Cai, Miao Zhang, Chris H. Q. Ding, Sharma Chakravarthy. Closed Form Solution of
Similarity Algorithms.(Poster), SIGIR'10
Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du, Xu Jia. An Adaptive Method for the Efficient
Similarity Calculation. DASFAA' 09
Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du. Efficient
Algorithm for Computing Link-Based Similarity in Real World Networks. ICDM’09.
Pei Li, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du. Exploiting the Block Structure of Link
Graph for Efficient Similarity Computation. PAKDD’09.
Yuanzhe Cai and Sharma Chakravarthy. Non-negative Matrix Decomposition vs. HITS. PKDD’14
(submitted)
Yuanzhe Cai and Sharma Chakravarthy. Identifying Specialists for Concepts. ICDM’14 (submitted)
Internship
09/2005 - 01/2007 Beijing BaseSoft Information Technologies Co., Ltd.
07/2005 - 09/2005 Shanghai Xinyou Information Technologies Co., Ltd.
Xi’an Software Park
02/2005 - 06/2005
Honors & Scholarships
TA Fellowship at University of Texas, Arlington from fall 2009 to current.
The third Scholarship in Xidian University in 2005
Three Year Fellowship from Renmin University from 2005-2008
IBM Web Sphere Certification
Three References
Dr. Sharma Chakravarthy, Professor, Department of Computer Science and Engineering, UT
Arlington
Dr. Chris Ding, Professor, Department of Computer Science and Engineering, UT Arlington