Post Job Free

Resume

Sign in

Software Engineer Information Systems

Location:
San Jose, CA
Posted:
November 23, 2023

Contact this candidate

Resume:

Shuyao He

Data Pipeline, Data mining, Data platform, Distributed systems, big data, JVM, unit test, testing, amazon s3, Kotlin, Go, HTTP TikTok, google, amazon, Stanford, Carnegie Mellon, Berkely, computer science, San Jose, CA ad1emb@r.postjobfree.com Tel: 408-***-**** http://www.linkedin.com/in/shuyao0110 github.com/Shuyao0110 EDUCATION

Northeastern University San Jose, CA

Master of Science in Information Systems Expected May 2025 City University of Hong Kong Hong Kong, China

Master of Social Science in Applied Psychology Jun 2023 Tongji University Shanghai, China

Bachelor of Engineering in Traffic Engineering GPA 4.12/5.00 Top 30% Jul 2022 National University of Singapore Singapore

Exchange Program Civil Engineering Dec 2021

Relevant Courses: Data Science Engineering Methods and Tools, Web Design and User Experience Engineering, Application Engineering and Development, Data Management and Database Design, Software Quality Control and Management, Software Methodology WORK EXPERIENCE

Zhejiang Sanhua Intelligent Control Co., Ltd Hangzhou, China Software Engineer Intern May 2023 - Sep 2023

● Participated in Master Data Governance in the company’s Enterprise Resource Planning (ERP): designed and built data warehouses

● Designed and built ETL (extract, transform, load) pipelines transferring over 20 thousand pieces of data between the Master Data System and the cloud Office Automation (OA) System in the Enterprise Service Bus (ESB) platform using RESTful APIs

● Improved the performance of the related process by 50% by optimizing the structure of pipelines and reducing the request frequency

● Documented the design process and the interfaces Xuesong Wang Research Group, Tongji University Shanghai, China Research Assistant May 2022 - Sep 2022

● Participated as an in-vehicle assistant in a driving rage incidents study: the tests were carried out in the 8 degrees of freedom high simulation driving simulator of Tongji University, and the experiment test scene was constructed with 30 drivers

● Implemented Python scripts to preprocess the raw pictures by using CV2 and built the training set by labelImg

● Trained a CNN model, using PyTorch, to predict the emotion of drivers by their facial expressions with 86% accuracy

● Evaluated the quality of data and video in the driving simulator by using Premiere ACADEMIC PROJECTS

Animation Websites Clawer and Data Visualization Feb 2023 - May 2023

● Built a Hadoop cluster containing one Namenode and five Datanodes on AWS EC2, and deployed MySQL, Hive, Docker and Sqoop on it

● Using Selenium and BeauifulSoup collected over 200 thousand pieces of data: scores, subscribers, purchases, and recommendations

● Utilized MySQL as temporary database and ETL the data to HDFS using Sqoop

● Queried and calculated the data like top 10 recommends, top 10 authors by volume, and weekly top list, using HiveSQL and warehoused in HBase

● Built RESTful APIs on the backend by using Flask framework and visualized the processed data by using Echarts in frontend E-commerce Platform Design and Implement Sep 2022 - Dec 2022

● Developed a full-stack e-commerce application based on SpringBoot, MySQL and Vue for responsive design and implementing dynamic search

● Integrated secure user authentication using Okta, JWT, OAuth2, and OpenlD Connect, along with SSL/TLS for secure communications

● Designed and implemented a Spring Boot backend using RESTful APIs for seamless communication and MySQL database for data management

● Improved the user experience significantly by categorizing the pictures and paginating the content

● Designed and implemented the network request library on the top of Axios, and improved components’ efficiency by using Pinia to manage the complex states

Shared Bike Operational Research Studies Sep 2020 - Jan 2021

● Built a Web Crawler, using Selenium and Geckodrive, to collect the target dataset shared by oBike

● Designed and implemented the crawling algorithm, utilizing Random Proxy and Time, to effectively lower the rate of getting blocked

● Managed the raw data by doing CRUD within MySQL

● Used scikit-Learn to perform Lasso Regression of shared bike usage amount with respect to urban construction factors (bike lane length and Shannon entropy of building), cycling distance, and bike fleet size; achieved a precision of 73.8%

● Calculated the marginal effect and elasticity between the usage number of shared bikes and bike fleet size. And put forward suggestions for the company based on the predicted results of the model SKILLS

Programming Languages: Python, Java, JavaScript, MATLAB, C# .NET, HTML, CSS Tools: Spring Boot, AWS, REST APIs, Docker, Linux, PyTorch, Hive, Hadoop, Flask, Spark, MongoDB, SQL, NoSQL, Web Scraping, React, node.js, Troubleshooting, Git, Unit Test



Contact this candidate