CONTACT
SKILLS
LANGUAGES
nguyenthixuanthanh17102
003.com
Dong, District Binh Tan
Program Languages
Python
C#
C/C++
Java
HTML/CSS/JS
SQL Sever
Other
Google Cloud/ AWS/ Hadoop
PowerBi
N8N - AI Agent
English (Intermediate)
French (Basic)
XUAN THANH NGUYEN
DATA ENGINEER FRESHER
PROJECT
PROFILE
"Final-year Information Technology student with a strong foundation in databases, ETL pipelines, and data modeling. Experienced with SQL, Python, and cloud platforms like Google Cloud and AWS through academic projects. Eager to apply my technical skills in a professional data engineering environment and contribute to building scalable data infrastructure.."
Student Academic Performance Analysis 2022 - 2023
Description:
Built a data pipeline to analyze students' academic performance based on multiple indicators (grades, attendance, behavior, etc.). Responsibilities:
Collected and cleaned data from multiple sources (CSV, Excel) using Python and Pandas
Designed and implemented an ETL workflow to transform raw data into structured formats
Used SQL to query and aggregate performance metrics Visualized insights using Power BI for better decision-making in academic planning
Technologies: Python, Pandas, SQL, Power BI
Clothing Store Product Data Analysis & Visualization Description:
Developed a system to process, analyze, and visualize sales and inventory data from a fashion retail website.
Responsibilities:
Built data ingestion pipelines to collect product and transaction data from the website backend
Cleaned and normalized datasets for consistent schema Created a dashboard to track product performance, trends, and customer preferences
Enabled filterable visualizations by category, brand, and time period Technologies: SQL Server, Google Cloud, Power BI, ASP.NET Core 2023 - 2024
EDUCATION
Information technology - Computer science
HIU - Hong Bang International University
2021 - 2025
Cardiovascular Disease Data Cleaning & Analysis
Description:
Performed in-depth data cleaning and preprocessing for a health dataset related to cardiovascular disease prediction. Responsibilities:
Identified and handled missing, duplicate, and inconsistent data entries Applied data transformation techniques (scaling, encoding, normalization)
Conducted statistical analysis to understand feature correlation Prepared clean datasets for use in predictive modeling Technologies: Python, Pandas, NumPy, Matplotlib, Seaborn, Google Colab 2024 - 2025
Stock Market Analysis and Evaluation
Description:
Conducted a data-driven analysis of stock market trends to evaluate the performance and volatility of selected companies over time. Responsibilities:
Collected historical stock data using APIs and public datasets (e.g., Yahoo Finance, Kaggle)
Processed and cleaned data to remove anomalies and prepare for analysis
Calculated financial indicators such as moving averages, volatility, and ROI
Visualized stock trends and risk levels using interactive dashboards Summarized insights to support investment strategy recommendations Technologies: Python, Pandas, NumPy, Matplotlib, Seaborn, Plotly, Google Colab
2023 - 2024