CAP HUU ANH TRI
*********@*****.*** +849******** Thu Duc city, Ho Chi Minh city
https://www.linkedin.com/in/capp2003/
https://github.com/CapHaTri
Education
University of Information Technology (UIT) - Computer Science Linh Trung Ward, Thu Duc City, Ho Chi Minh City
GPA: 3.24/4.00
TOEIC : 545
September 2021 - Present
Intern Data Engineer
Summary
As a third-year Computer Science student, I am eager to secure an internship position as a Data Engineer. My goal is to gain practical experience, improve my skills and explore new opportunities in the field. With a solid foundation in computer science principles and a willingness to learn, I am enthusiastic about contributing to projects and expanding my skill set in data engineering. I am eager to apply my knowledge and collaborate with experienced professionals to tackle real-world challenges and make meaningful contributions to the team.
Project
Driving Declinometer Estimation with Kafka Streaming - Personal Project Implement a real-time driving declinometer estimation system Data Analysis: Analyze data using Pandas, Seaborn
Model Training: Train model and scalers, save as .joblib files. Kafka Streaming Usage: Stream data from vehicle sensors, predict using trained model. Real-time Prediction: Successfully transmit predicted values through Kafka. Recommended Topic Hashtags - Group Project
Create an automated labeling system for Facebook posts (about ML and DS) Data Collection: Utilize Selenium to crawl Facebook posts data. Data Preprocessing and Labeling: Preprocess the data and assign labels to each post based on predefined hashtags.
Model Training: Train Machine Learning models using TF-IDF and Bag of Words . Evaluate and select the best-performing model
Demo: Create a user-friendly interface with Streamlit that can automated labeling Facebook post
Booking Medicine Web Application Project - Personal Project Creates an online prescription system.
Web App Design: Design admin and user interfaces for online prescription booking UI Development: Build interfaces using ReactJS for a modern user experience. Database Setup: Establish a mySQL database to store prescription data efficiently. Backend Implementation: Develop backend functionality using the Express framework and RESTful APIs.
Successfully build an online prescription system for online prescription booking Link
Link
Link
Image Retrieval - Personal Project
Create a model that suggests images of animals ( list of animals is provided) Data Source: Utilize a dataset available on Kaggle Feature Extraction: Extract image features using the VGG-16, and store Similarity Calculation: Calculate the Cosine Similarity between image search features and the features stored in the database. Select images with the smallest cosine distances. Demo: Develop a user-friendly demonstration interface using Streamlit that can suggests images of animals base on input image
Skills
Programming Languages: Python (pandas,
matplotlib,numpy,seaborn,beautifulSoup4,
selenium), Javascripts, C++, SQL
Tools and Frameworks: Scikit-learn,
Tensorflow, Streamlit, Canva, Postman,
Latex, Docker, ReactJS, NodeJS, Git
Big Data: Apache Hadoop, PySpark, Kafka
Database: mySQL
Microsoft Office:Excel
Languages : Vietnamese, English
Link