Post Job Free
Sign in

Information Technology Machine Learning

Location:
Binh Duong, Vietnam
Posted:
September 26, 2024

Contact this candidate

Resume:

NGUYEN TAN DUNG

+843******** # ************************@*****.*** § github.com/Dunglqd ï linkedin.com/in/tandunglqd Education

VNUHCM - University of Information Technology (UIT) Expected May 2025 Bachelor Degree in High-Quality Information Technology Programs (GPA: 3.27 / 4.00) Ho Chi Minh

• IELTS: 5.5

• JLPT: N3

Projects

Chatbot system for UIT Admissions ChromaDB, LLM, Generative AI, Prompt Engineering, Llama-index, API, MongoDB, BM25

• Leverage RAG (Retrieval-Augmented Generation) with hybrid search (Vector and Keyword) to gather and classify admission information for UIT.

• Design prompts for the generation module and integrate modules such as retrieval, classification, and reranking to provide accurate responses.

• Implement a vector database with ChromaDB and additional databases with MongoDB.

• Build the user interface using Streamlit and connect the backend via FastAPI.

• Technologies used: Llama-index, Langchain, MongoDB, ChromaDB, Redis, FastAPI, Pydantic, Streamlit, Chainlit, Transformers, Torch, Sentence Transformers, Selenium, BeautifulSoup4, Pandas, Underthesea, VNCoreNLP, Lingua DDoS Attack Detection System Machine Learning, Kafka, Spark Streaming, TensorFlow

• Leverage Machine Learning with hybrid models (XGBoost, RBF-SVM, Decision Tree, and DNN) to detect DDoS attacks using Kafka streaming data.

• Design and implement real-time data processing and feature extraction using Apache Kafka and Spark Streaming.

• Deploy models using TensorFlow/Keras for analyzing network traffic and predicting DDoS attacks.

• Evaluate the performance of each model with metrics like accuracy, precision, recall, and F1 score, achieving up to 99.18

Vietnamese Sentiment Analysis System ChatGPT, BERT, ViBERT, SVM

• Integrate ChatGPT for Vietnamese sentiment analysis alongside traditional models like BERT, ViBERT, and SVM.

• Process and classify Vietnamese text data using natural language processing techniques.

• Evaluate model performance with metrics like accuracy, precision, recall, and F1 score, with notable improvements in sentiment prediction accuracy.

• Implement feature extraction and text preprocessing methods tailored for Vietnamese language specifics.

• Develop a pipeline combining machine learning and deep learning models for robust sentiment analysis of Vietnamese text data.

Real-time Toxic Comment Analysis From Live YouTube Videos Pytchat, Kafka, Apache Spark, PhoBERT, Docker

• Build an application for analyzing sentiment of toxic comments on live YouTube videos.

• Use Pytchat for comment extraction, Kafka and Spark for data processing, PhoBERT for classification.

• Train the model on the ViHSD dataset (Vietnamese Hate Speech Detection).

• Achieve 85.34 percent accuracy in comment classification.

• Develop a web application to visualize the analysis results. Hotel Management System SQL, C, 3-Layer Architecture

• Develop hotel management software with basic features: booking, check-out, reporting.

• Define business requirements using forms and regulations, then model them with data flow diagrams.

• Design the system with 3-layer architecture, database, and user interface, then implement and test core functionalities.

Achievement

• Top 5 in Bosch Coderace Challenge 2023 - Exceed the Limitless Mind.

• Certificate ”Sinh Vien 5 Tot” by The University of Information Technology in 2022, 2023 Publication

• Exploring the Performance of ChatGPT for Vietnamese Sentiment Analysis, UIT Young Scientists and Fellows Conference, 2023. Dai Nguyen Ba, Nguyen Tan Dung, Dang Van Thin. Technical Skills

Generative AI • Prompt Engineering • API • Text Mining • Tensorflow • Pytorch • Problem Solving • NLP • Git and Github • Docker • MLOps • SQL



Contact this candidate