Nguyen Van Hao
DATA ENGINEER/DATA ANALYST
PROFILE
Male
*****************@*****.***
https://www.linkedin.com/in/nguyễn-văn-hào
795217189/
Ho Chi Minh City, Viet Nam
SKILLS
Language
English - TOEIC L&R 865
Developer
SQL
Python
Power BI
Apache Spark-Pyspark
Apache Hadoop
Apache Kafka
Linux (Ubuntu)
HONORS & AWARDS
2024-2025, 2023-2024, 2022-2023
Semester 1 Academic Encouragement
Scholarship
OBJECTIVE
As a passionate IT person, I have always sought a suitable environment to self-growth with new technologies. I am currently looking for a Data Engineer/Data Analyst position where I can gain professional knowledge and dedicate my skills, abilities, and experience to contribute to the company's development.
EDUCATION
HCMC University of Technology and Education 2021 - 2025 Major: Information System
GPA: 3.83/4.0
PROJECTS
Simulate real-time Twitter
sentiment analysis
07/2024 - 08/2024
Team size: 1
My responsibilities:
• Collected live tweets using Kafka from the Twitter DataSet on Kaggle
• Utilized Spark Streaming to process and analyze the data in real-time
• Classified tweets by sentiment using NLP techniques.
• Stored the sentiment analysis results in MongoDB for persistence Git: https://github.com/Hao12B2/twitter_kafka_spark_streaming_etl.git Build a data warehouse on
graduation rates of US Universities
04/2024 - 05/2024
Team size: 4
My responsibilities:
• Understood business and designed a snowake schema
• Designed and built a data warehouse using SQL Server
• Implemented SSIS, SSAS on Microsoft Visual Studio and used Power BI tool to visualize and answer some questions related to business Git: https://github.com/Hao12B2/Data_Warehouse_Project.git Loan prediction based on customer
behavior
04/2024 - 05/2024
Team size: 4
My responsibilities:
• Handled null value, duplicated value and misformatted name
• Used machine learning (Logistic Regression, KNN, Random Forest) and deep learning (ANN) model to predict loan defaults.
• Implemented clustering using Kmeans
Git: https://github.com/Hao12B2/Loan_Prediction.git