NGUYỄN THÀNH LONG
**.**.**.**** ***************@*****.*** linkedin.com/in/long-tyrant-z777z
github.com/bananacat12
OBJECTIVE
Computer Science student concurrently pursuing a Master’s in Artificial Intelligence, with a focus on deep learning, computer vision, and multimodal learning. Passionate about applying Python, PyTorch, and advanced AI techniques to real-world challenges. Experienced in leading and deploying AI projects such as Visual Question Answering, Neural Machine Translation, and Image Captioning. Seeking an opportunity to contribute to impactful AI systems while gaining hands-on experience in a collaborative, innovation-driven environment.
TECHNICAL SKILLS
• Programming Languages:Python, Java, JavaScript
• Machine Learning/Deep Learning:TensorFlow, PyTorch, Scikit-learn, Keras, HuggingFace Transformers
• Computer Vision:OpenCV, CNNs, YOLO, Image Segmentation, Visual Question Answering
• Natural Language Processing:Transformers, BERT, GPT, Text Summarization, Sentiment Analysis
• Data Science & Visualization:Pandas, NumPy, Matplotlib, Seaborn, SQL, Tableau
• Tools & Platforms:Git, Linux, Google Colab, Jupyter
• Frameworks & Libraries:Streamlit
EDUCATION
• Ton Duc Thang University (TDTU) Ho Chi Minh City, Vietnam Master of Science in Computer Science 2024 – Present
• Ton Duc Thang University (TDTU) Ho Chi Minh City, Vietnam Bachelor of Science in Computer Science 2022 – 2026 PROJECTS
• Visual Question Answering (VQA) System 2025
Tools: Python, PyTorch, COCO-VQA Dataset, CNN+Transformer [§]
Designed a deep learning pipeline combining CNN and Transformer architectures to answer natural language questions about images
Achieved 78% accuracy on the COCO-VQA dataset, surpassing baseline models by 12%
Optimized model inference speed by 25% using mixed-precision training
• Neural Machine Translation with GPT 2025
Tools: HuggingFace Transformers, PyTorch, Vietnamese-English Dataset [§]
Fine-tuned a GPT-3 model for Vietnamese-to-English translation, achieving a BLEU score of 32
Preprocessed a dataset of 50,000 sentence pairs, improving model robustness with data augmentation
Deployed model using FastAPI, enabling real-time translation for 100+ users
• Image Captioning System 2025
Tools: Python, CNN + LSTM, Transformer, PyTorch, Flickr8k Dataset [§]
Developed an encoder-decoder model using CNN and LSTM to generate descriptive captions for images
Achieved a BLEU-4 score of 0.65 on the Flickr8k dataset
Integrated model into a web app using Streamlit for real-time demonstrations
• OCR System for Scanned Documents 2025
Tools: OpenCV, CNN, Transformer, Python [§]
Built an OCR pipeline for extracting text from scanned documents
Applied image preprocessing techniques (e.g., binarization, noise reduction) to improve text recognition
Automated processing of 1,000+ documents, reducing manual effort by 80%
• Real Estate Price Prediction 2024
Tools: Pandas, Scikit-learn, XGBoost, Random Forest, Linear Regression, Decision Tree [§]
Developed a Random Forest regression model to predict resale HDB prices in Singapore, achieving an R2 score of 0.89
Applied SHAP to interpret predictions, highlighting floor area and lease remaining as key factors
Processed 10,000+ listings with ‘ColumnTransformer‘, including imputation, scaling, and encoding
• Top-K High Utility Itemset Mining 2024
Tools: Python, Heapq, Data Mining, Uncertain data, Top-K high-utility itemsets mining, Interactive mining [§]
Built a ‘Node‘ structure representing itemsets and their associated transaction subsets to support utility mining
Implemented a Top-K High Utility Itemset Mining algorithm using depth-first search (DFS) and pruning strategies
Simulated uncertain databases and visualized performance trends using Matplotlib
Applied team-based rule to determine the project topic by hashing member names to topic index OTHER SKILLS
• Languages:Vietnamese (Native), English (B2 Aptis - CEFR)
• Soft Skills:Leadership (Project Leader), Problem-solving, Teamwork, Agile Workflow, Research & Technical Writing
• Interests:Football, Table Tennis, Badminton
CERTIFICATIONS
• B2 Aptis English Certificate – British Council 2024
• Advanced Computer Vision with TensorFlow – Coursera 2025
• Neural Networks and Deep Learning – Coursera 2025
• Python for Data Analysis: Pandas & NumPy – Coursera 2025