Trung-Kien Nguyen
088*******-**/**/**** *****.****@*****.*** github.com/kiendoo4 linkedin.com/in/kiendoo4/ Summary
About me Driven by a desire to unlock the potential of human language, I specialize in developing cutting-edge NLP applications. Committed to continuous learning and adaptation. Carrer objective Currently seeking a position as an AI Engineer or Research Assistant. Education
University of Information Technology - UIT Sept 2021 - Expected June 2025
• Bachelor of Computer Science, Mass Program, 3rd year completed.
• Coursework: Natural Language Processing, Information Retrieval, Introduction to Software Engineering, . . . Project and my contribution
Ad-hoc Document Retrieval for Vietnamese Newspapers Oct 2023 - Dec 2023 Technologies used: Python, PhoBERT, Vietnamese-SBERT Collaborators: HatakaCder, hungnv2003bta
• Developed a data retrieval pipeline to implement a hybrid information retrieval system combining tra- ditional keyword matching (TF-IDF, BM25) with the contextual understanding capabilities of pre-trained language model. Two corpora of about 50,000 Vietnamese news articles were collected via web crawling techniques for demonstration and evaluation purposes.
• An experiment was conducted to compare the performance of different methods. Our results indicated that the combination of TF-IDF & Vietnamese-SBERT significantly outperformed competing approaches, attaining an nDCG@10 score of 95.82% from our test set.
• Github: https://github.com/HatakaCder/VNnSE Flask/ Student Management Sept 2023 - Dec 2023
Technologies used: C#, XAML, SQL, Entity Framework, Git Collaborators: HatakaCder, AnTran210, LuongDaiPhat
• Proficient in C# and XAML for developing a software application for efficient school management.
• Established user roles (admin, teacher), designed a SQL-based database for CRUD operations and employed Entity Framework to optimize data access and manipulation between the database and C# application.
• Leveraged Git expertise to integrate individual contributions into a cohesive project repository.
• Github: https://github.com/kiendoo4/StudentManagement Document-Specific Retrieval Augmented Generation May 2024 - June 2024 Technologies used: Python, LLM (Gemini-pro), LangChain, SBERT, FAISS Collaborator: HatakaCder
• Converted Wikipedia articles into dense vector representations using SBERT - a pre-trained language model to facilitate similarity search and implemented FAISS for efficient storage and retrieval of these embeddings.
• Employed a LLM (Gemini-pro) to generate answers by processing a subset of the most pertinent articles.
• Implemented RAG pipeline using LangChain, incorporating document loading, embedding, vector store creation and retrieval to enhance question answering capabilities.
• Github: https://github.com/kiendoo4/DocumentSpecificRAG Skills
Technical skills
• Adept in data engineering and machine learning techniques, I am equipped to transform raw data into actionable insights and implement innovative machine learning solutions.
• Programming languages & Technologies: Python, C++, C#, XAML, Git, SQL, LATEX, . . . Soft skills
• Language: Proficient in English with IELTS 6.5 (TRF number: 23VN012469NGUT028A).
• Leadership: Team leader of all of the projects above, I led and coordinated all project phases.
• Presentation & Communication: Team presenter for all aforementioned projects. Skilled in creating engaging visual content through LATEXto enhance presentation impact.