Post Job Free
Sign in

Data Science Machine Learning

Location:
Quan 1, 71000, Vietnam
Posted:
October 08, 2024

Contact this candidate

Resume:

Bui Dinh Bao +84-090*******

Data Science Student § github.com/fabyanbui

Final-year ï linkedin.com/in/fabyanbui

Ho Chi Minh City University of Science # *******.*****@*****.*** As a final-year Data Science student with a strong passion for technology and scientific advancement, I am eager to apply my problem-solving skills in a dynamic internship environment. I am seeking an internship opportunity in Data Science, Data Analysis, Python development, or AI to complement my academic studies and contribute significantly to innovative projects. My goal is to enhance my skills and pave the way for advanced studies and impactful research.

Education

VNUHCM - University of Science Expected Graduation October, 2025 Bachelor in Data Science, Faculty of Information Technology Current GPA: 3.71/4.00 Relevant Coursework

Data Visualization Data Mining Data Analysis Statistical Learning Machine Learning Introduction to Artificial Intelligence Database Introduction to Data Science Programming for Data Science Probability and Statistics Computational Statistics and Applications Data Structures & Algorithm Object-Oriented Programming Skills

• Programming languages: Python, C/C++, R.

• Knowledgeable in Power BI, Tableau, Jupyter Notebook, SQL, Git&GitHub and LATEX.

• Experienced in using Python from data crawling to data preprocessing, visualization and analysis.

• Strong in logical thinking, mathematics and problem-solving.

• English Proficiency: TOEIC 4 skills (Listening 395, Reading 385, Speaking 130, Writing 170). Class Projects

Analysis and Dashboard for Monthly Air Passengers in America May 2024 - Jun 2024 Data Visualization Course - Group Project (Team Size: 5)

• Technology:

Language: Python (Jupyter Notebook).

Libraries: numpy, pandas, matplotlib, seaborn, plotly, statsmodel.

Visualization and Dashboard Creation: Tableau.

• Description: In this project, students are tasked with studying and applying data visualization techniques specifically tailored for time series data. The primary learning objectives include gaining proficiency in identifying, analyzing, and visualizing temporal datasets to uncover trends, patterns, and anomalies over time, then using Data Visualization Tools like Tableau or Power BI to support interaction.

• Role: Visualize 2/5 charts corresponding to 2 sheets in official Tableau Dashboard. Make a video of presentation for the project. Also, understanding and being able to undertake all the phases of the data project, include preprocessing and machine learning tasks. (§ click to view) Learning Agency Lab - Automated Essay Scoring 2.0 May 2024 - Jun 2024 Data Analysis Course - Group Project (Team Size: 6)

• Technology:

Language: Python (Jupyter Notebook).

Libraries: numpy, pandas, sklearn, xgboost, lightgbm.

Text processing and feature extraction.

• Description: This is a famous Kaggle Competition. The goal is to train a model to score student essays. Using given tabular data, students’ efforts are needed to reduce the high expense and time required to hand grade these essays. In order to do that, apply machine learning and fine tune model appropriately.

• Role: Understand and preprocess data (include handling text data and extracting features). Build and tune the main model. Optimize the prompting engineering model. To be honest, I am the team leader, the key member of this project. (§ click to view)

Text Summarization Application in Transformer Model Architecture Jun 2024 - Jul 2024 Statistical Learning Course - Individual Project

• Technology:

Language: Python (Jupyter Notebook).

Libraries: pyngrok, torch, transformers, datasets, streamlit.

Fine tuning LLM on a specific dataset.

• Description: Developed a practical application leveraging the Transformer architecture, which underpins ChatGPT. The project focused on utilizing the self-attention mechanism of Transformers to achieve superior performance in natural language processing tasks.

• Role: Identifying the model and the dataset. Choosing appropriate metrics for computing loss function. Tokenize data and fine tune the model. Evaluate and deploy the model to a web application. (§ click to view)

Social Activities

• Participated in Spring Volunteer Campaign 2024.

• Participated in Green Summer Volunteer Campaign 2024.



Contact this candidate