Haocheng Zhao
adhium@r.postjobfree.com 919-***-**** Chapel Hill, NC
LinkedIn: https://www.linkedin.com/in/haocheng-zhao-162118191/
GitHub: https://github.com/Houdinic
Education
University of North Carolina, Chapel Hill Aug. 2019 - Present
M.S. in Information Science; GPA: 4.00/4.00
University of California, Berkeley Jan. 2018 - May. 2018
Bachelor’s Exchange Student; GPA: 3.88/4.00
Related Courses: Introduction to AI (A), Structure and Interpretation of Computer Programs(A), Data Mining
Tianjin University Sep. 2015 - Jun. 2019
Bachelor of Management in Information Management and Information Systems
Skills
Programming languages: Python, C++, C, Java, R, SQL, HTML, Shell script, CSS, Markdown.
Experience in Machine Learning, NLP, Data Structure, Computer Organization, Algorithm, Internet Protocol.
Tools and Framework: Git, Jekyll, Ansible, Agile, Sklearn, Keras, Numpy
Professional Experience
Project Gutenberg (PG). Chapel Hill, NC, United States
Full Stack Engineer Feb. 2020 - Present
Participate in daily maintenance of Project Gutenberg production site and the construction of new Project Gutenberg site in Agile environment. The production site has over 6 million downloads per month.
Compile htaccess files on Centos7 server to manage developing PG sitelinks. Assist in new Centos8 server deployment using Automation tool Ansible for Php and Apache installation and firewall set up.
Use SQLAlchemy and PostgreSQL to implement ORM in Libgutenberg greatly facilitating the maintenance. Research into and fix long existing abnormal in search result on PG developing site. The problem is caused by Genshi template used by PG search engine Autocat3.
Test and improve CSS and HTML5 design for Gutenberg site based on the survey from over 200 users. Improve taxonomy structure and redesign Bookshelf cataloging function using Django on new PG site.
CAHL Lab at University of California, Berkeley Berkeley, CA, United States
Lab researcher May. 2018 - Jan. 2019
An achieved model generator that can generate 3 models (BOW, TF-IDF and DOC2VEC) using Python for different text input.
Proposed a new metric to model and comprehensively evaluated it with other 2 existing ones. Generated 15 new articulation pairs proposal for each unarticulated course (188 in total).
Process initial articulation data set from assit.org with self-developed pipelines with Beautiful Soup and specify all edge cases.
TianAn Smart Beijing, China
Intern engineer Jul. 2017 - Sep. 2017
Developed front-end code template for database system webpage in HTML, JavaScript and Vue.
Improved more than 50 database website pages layout for PetroChina and worked collaboratively with internal and external teams to complete the whole project 2 months ahead of schedule.
Projects
Weather Forecasting Contest
Managed to fill all missing values of past year data(180k+ in total) in a novel way and performed feature selection using Python with SciKit-learn API.
Achieved a significant low MSE and won the 3rd place in contest among over 30 teams.
Handwritten Number Recognition
Handwritten numbers were flattened into 784-dimension vector and used One-Hot vector to encode the correct class. Adopted multiple layers in Neural Network model using SoftMax as loss function reaching 97.3% accuracy.
Publications
Zachary Pardos, Hung Chau, Haocheng Zhao, Data-Assistive Course-to-Course Articulation Using Machine Translation, Proceedings of the 6th ACM Conference on Learning @ Scale (L@S) June 24-25, 2019, Chicago, Illinois, USA [Full Research Paper] [Slides] *Best Paper Award*