Post Job Free
Sign in

Data Engineer

Location:
Quan 1, 71000, Vietnam
Posted:
November 26, 2024

Contact this candidate

Resume:

Nguyen Hoang Thinh

Ho Chi Minh City, Viet Nam

+84-854-***-*** ****************@*****.***

EDUCATION

University of Science, VNU – HCMC (HCMUS) Oct 2020 - Jun 2024 Bachelor of Science in Computer Science

● GPA: 3.58 / 4.0

● Relevant Coursework: Data Structures and Algorithms, Object Oriented Programming, Introduction to Databases, Introduction to Big Data, Introduction to Data Science, Introduction to Machine Learning, Programming for Data Science, Data Mining and Applications, Intelligent Data Analysis, Deep Learning for Data Science, Parallel Programming EXPERIENCE

TMA Solutions Apr 2024 - Jun 2024

Data Analyst Intern

● Researched usage of Docker, ELK Stack (Elasticsearch, Logstash, Kibana)

● Resolved a Data Science problem

o Preprocessing data

o Visualizing data

o Building model (Classification)

OFFICIENCE Jun 2024 - Aug 2024

Fresher Data Engineer

● Implemented crawling data (Bash script, Awk, Python)

● Worked on Linux OS, interacted with database (MySQL) and server

● Created and scheduled jobs running automatically on Jenkins SKILLS

● Programming:

o Python, C/C++, HTML/CSS, Java

● Techniques:

o Data scraping: Python (Request, Beautiful Soup, Selenium, Scrapy), Bash Script, Awk o Databases: MS SQL Server, MySQL, MongoDB

o Analysis/Visualization: Python (Numpy, Pandas, Matplotlib, Seaborn), PowerBI o ML/DL: Python (Scikit-learn, Pytorch, Tensorflow) o Big Data: Hadoop, Spark

o Automation: Jenkins, Airflow

o Other: Github, Linux, Streamlit

● Soft skills:

o Teamwork and effective collaboration, analytical thinking and problem-solving skills, ability to quickly adapt to new environments

PROJECTS

NLP WITH DISASTER TWEETS Link Github Apr 2023 - Jun 2023 Main Problem: Predicting natural disasters based on statuses on Twitter

● Tasks: Data preprocessing, Data modeling with transformer models (BERT, RoBERTa) and Model evaluating

● Programming Techniques: Python Numpy, Pandas, Scikit-learn, Pytorch, Tensorflow MEN’S FASHION SHOPEE ANALYSIS Link Github Oct 2023 - Jan 2024 Main Problem: Analyzing competitors in the men’s fashion on Shopee

● Tasks: Data scraping, Data storing, Data preprocessing, Data analyzing/visualizing

● Programming Techniques: Python Request – Selenium, MongoDB, Numpy, Pandas, Scikit-learn, PowerBI

MOVIE RECOMMENDATION SYSTEM Link Github Mar 2024 - Jun 2024 Main Problem: Building a movie recommendation system based on various methods

● Tasks: Data scraping, Data storing, Data analyzing/visualizing, Data modeling (Softmax Deep Neural Network), Web demo building

● Programming Techniques: Python Request – Beautiful Soup, MongoDB, Numpy, Pandas, Matplotlib - Plotly, Scikit-learn, Tensorflow, Streamlit KNOWLEDGE

HADOOP KNOWLEDGE Link Github Jul 2023 - Aug 2023

● Tasks: Hadoop installation, Data storing in Hadoop, MapReduce program

● Programming Techniques: Java Hadoop, Linux

PYSPARK KNOWLEDGE Link Github Jul 2023 - Aug 2023

● Tasks: Data query, RDD-based manipulation, data mining

● Programming Techniques: Python Spark

SQL KNOWLEDGE Link Github Feb 2024

● Tasks: Query authoring (Basic, nested, advanced query,…), Stored Procedure, Function, Trigger

● Techniques: MS SQL Server

HONORS & AWARDS

GOLD MEDAL, 4

th

HCMC APRIL OLYMPIC (MATH) Apr 2018

BRONZE MEDAL, 5

th

HCMC APRIL OLYMPIC (MATH) Apr 2019

LANGUAGES

● Native in Vietnamese

● Intermediate proficiency in English

LINKS

● Github / HThinhN

● Github / HThinhZ

● Linkedin / Nguyen Hoang Thinh

● Facebook / Nguyen Hoang Thinh



Contact this candidate