Phan Quoc Dai Son
Thu Duc District, Ho Chi Minh City • ***********@*****.*** • 077******* • GitHub • LinkedIn Summary
Third-year Computer Science student at HCMUT interested in Artificial Intelligence, focusing on Computer Vision
(YOLO, OCR, CNN) and Speech Processing (ASR models such as Conformer and Whisper, and Text-to-Speech). Experienced in building end-to-end pipelines for vision-based applications and working with databases, including schema design and query optimization. Seeking internship opportunities in AI Engineering, with additional interest in Data Engineering to support scalable AI systems. Education
Ho Chi Minh City University of Technology (HCMUT - VNUHCM) Ho Chi Minh city, VietNam Bachelor of Computer Sience (English-medium Teaching and Learning Program) Sep, 2023 - Present GPA: 3.5/4.0
Experience & Projects
AI Challenge 2025 Competition Vietnam
Team Member – Computer Vision and OCR Track July, 2025 – September, 2025
• Collaborated in a 5-member team to design and implement an end-to-end computer vision pipeline integrating YOLO-based object detection and OCR for automated information extraction from images.
• Extracted video keyframes using PySceneDetect to generate image data for the detection and recognition pipeline.
• Developed search methods based on information extracted from OCR and BLIP-2, combining AI models to enable content-based image retrieval.
• Achieved a score of 54/87 in the qualification round of the competition benchmark. Receipt Recognition System VietNam
Team Member – Computer Vision August, 2025 – October, 2025
• Developed a receipt recognition pipeline using YOLO for detection and PaddleOCR for text extraction.
• Trained OCR models on receipt images to improve recognition performance. VETC InsureAssist – AI-based Claim Assistance System LotusHacks x HackHarvard x GenAI Fund Vietnam
Team Member – AI & Backend Developer Mar 2026
• Developed an end-to-end AI pipeline for insurance claim processing, including OCR-based data extraction, structured data processing, and AI-driven document verification.
• Designed document understanding workflows to extract and organize information from claim documents for downstream AI reasoning.
Cinema Web Application with Database System VietNam Team Leader - Backend Developer October 2025 - November 2025
• Led a team developing a cinema booking web application with MySQL database, Python FastAPI backend, and web frontend.
• Designed relational database schema and optimized SQL queries for movie scheduling, ticket booking, and user management.
• Implemented RESTful APIs with FastAPI (Python) to connect frontend services with backend database ope- rations.
Activities
HCMUT Machine Learning & IoT Lab Ho Chi Minh City, Vietnam AI Research Member June 2025 - Present
• Participate in research activities related to machine learning and artificial intelligence applications.
• Assist in implementing and experimenting with ML models for data analysis and AI-driven systems. Online Course
CS231n: Deep Learning for Computer Vision (Stanford University) Online Student January 2026 – Present
• Studying fundamental concepts of deep learning for computer vision, including CNN architectures, object detection, and image recognition.
• Implementing neural network models and experimenting with deep learning techniques for visual understanding tasks.
Getting Started with Deep Learning – NVIDIA Aug 2025 Certificate of Competency
• Completed NVIDIA course on deep learning fundamentals and GPU-accelerated AI. Skills & Interests
Programming Languages: Python, C++
Libraries & Frameworks: PyTorch, OpenCV, PaddleOCR, YOLO, FastAPI Tools & Technologies: MySQL, REST APIs, Git, Google Colab, HuggingFace Core Concepts: OOP, Data Structure & Algorithms, Computer Networks Language: English (IELTS 6.5 – 12/2022)