DAI NGUYEN BA - NLP ENGINEER
+849******** **.*****@*****.*** www.linkedin.com/in/itdainb
Thu Duc - Ho Chi Minh City
I am a passionate and motivated student specializing in Natural Language Processing. With a strong aptitude for problem-solving and a focus on cutting- edge technology, I am committed to staying up- to- date with the latest developments in the field. My skills in data analysis, programming, and machine learning make me a valuable asset to any team. Education Level
VNUHCM - University of Information Technology (Trường Đại học Công Nghệ Thông Tin - Đại học Quốc gia Thành phố Hồ Chí Minh) - Quarter 6, Linh Trung Ward, Thu Duc District, Ho Chi Minh City
Sep 2021 - Sep 2025 (Expected)
Bachelor Degree in High-Quality Information Technology Programs, 3.28/3.99 Research
Awards
First and second rank in some languages in the shared task Sentiment Analysis for African Languages – The 17th International Workshop on Semantic Evaluation SemEval-2023.
Third rank in the shared task of Financial Targeted Sentiment Analysis in Spanish – The fifth Evaluation Campaign of Natural Language Processing Systems in Spanish and other Iberian languages IberLEF2023. Third rank in the shared task of Categorical Emotion Detection in Italian Social Media – The 8th Evaluation Campaign of Natural Language Processing (NLP) and Speech Tools for Italian EVALITA 2023. Top 5 in Bosch Coderace Challenge 2023 - Exceed the Limitless Mind. Top 10 in Quy Nhon AI Hackathon 2022 – The largest AI competition for engineers organized by Quy Nhon AI Center, FPT Software in collaboration with the Ministry of Science and Technology and VNExpress. Publications - Accepted paper
[1] Dang Van Thin, Dai Nguyen Ba, Duong Ngoc Hao, and Ngan Luu- Thuy Nguyen. ABCD Team at FinancES 2023: An Unified Generative Framework for the Financial Targeted Sentiment Analysis in Spanish. The fifth Evaluation Campaign of Natural Language Processing Systems in Spanish and other Iberian languages (IberLEF), 2023.
[2] Dai Nguyen Ba, Uyen Nguyen Ngoc Phuong, Dang Van Thin. Ensemble Approach for Categorial Emotion Detection in Social Media Messages: EMit at EVALITA 2023. The 8th Evaluation Campaign of Natural Language Processing (NLP) and Speech Tools for Italian (EVALITA), 2023.
[3] Dang Van Thin, Dai Ba Nguyen, Dang Ba Qui, Duong Ngoc Hao and Ngan Luu-Thuy Nguyen. ABCD Team at SemEval-2023 Task 12: An Ensemble Transformer- based System for African Sentiment Analysis. The 17th International Workshop on Semantic Evaluation SemEval, 2023.
[4] Dai Nguyen Ba, Phan Ca Phat. An Approach for Vietnamese - Chinese Machine Translation Based on Language Models, UIT - Research topics for undergraduate and graduate students, 2023.
[5] Nguyen Viet Anh, Dai Nguyen Ba, Dang Van Thin, and Ngan Luu- Thuy Nguyen. Emotion Classification in Comments Across Multiple Fields. UIT - The Young Scientists and Researchers Conference, 2022. Project
The Legal Text Mining Project: Information Extraction from Legal Documents Jun 2022 - Nov 2022 NLP Engineer
Develop and implement NLP algorithms for legal text mining Collaborate with data science team to prepare textual data for NLP analysis Evaluate and refine NLP models to improve accuracy and performance Integrate NLP models into production software systems Stay up-to-date with latest NLP research and evaluate new techniques Communicate progress and findings to project team and stakeholders Multi-Domain Sentiment Analysis for Shopee Reviews Nov 2022 - NLP Engineer
Develop and implement web crawling scripts to collect Shopee review data from multiple domains/categories. Preprocess the collected data by cleaning and transforming it into a suitable format for sentiment analysis. Annotate the Shopee reviews data with sentiment labels (positive, negative, neutral) according to predefined guidelines. Communicate progress, challenges, and findings to the project team, and stakeholders, and potentially contribute to reports or documentation related to the project.
Skills
TOEIC 900+ (2023): .
Natural Language Processing with Probabilistic Models (2023): . Natural Language Processing with Classification and Vector Spaces (2023): . Problem Solving (Intermediate) (2023): .