Nguyễn Hữu Tâm
Data Engineer Intern
Birth of day : 01/04/2005
Sex: Nam
SĐT: 092*******
Email: *********@*****.***
EDUCATION
**** - **** **** ********** ** Technology and Engineering Data Engineering
GPA: 3.1
SUMMARY
Data Engineering student seeking an internship opportunity to apply knowledge in Python, Java, databases, Hadoop and Spark. Familiar with ETL/ELT processes and data processing concepts, with strong learning ability, responsibility, and enthusiasm for developing practical skills in a professional environment. TECHNICAL SKILLS
Programming Language: Python, Java
Databases: MySQL, PostgreSQL,SQL Server
Data Engineering: Apache Spark, Apache Pig, Apache Superset, Power BI Data Acquisition & ETL: ETL/ELT Pipelines
Developer Tools: Git, GitHub, Docker
PROJECTS
Analysis Weather Brazil Project (Feb 2026 – May 2026)
Designed and implemented a Big Data weather analytics platform using Apache Spark, Apache Iceberg, and AWS S3 to process over 5GB of weather data (40M+ records) collected hourly from weather stations across Brazil.
Built a Lakehouse architecture (Bronze–Silver–Gold) for scalable and maintainable data processing, supporting incremental ingestion and enabling reliable handling of hundreds of thousands of records per processing cycle.
Developed distributed ETL pipelines using Apache Spark for weather data ingestion, cleansing, transformation, and aggregation, processing over 5GB of weather data and enabling scalable batch analytics through parallel data processing
Applied data quality validation with Great Expectations and managed metadata using AWS Glue Catalog.
Visualized regional weather patterns and analytical dashboards using Tableau, enabling analysis of temperature, rainfall, and climate trends across different regions of Brazil. Crypto-ETL-Warehouse Project (Feb 2026 – Present)
Developed an end-to-end crypto data warehouse pipeline using Apache Spark, Airflow, PostgreSQL, and Power BI.
Built ETL workflows to collect cryptocurrency market data from APIs, then transform and load data into a warehouse.
Implemented raw and staging layers for data cleaning, preprocessing, and aggregation.
Automated scheduling and monitoring of ETL jobs using Apache Airflow.
Designed warehouse schemas and analytical tables for daily crypto market analysis, supporting trend evaluation and comparative analysis of cryptocurrency performance. Netflix Data Analytics Project (Nov 2024 – Dec 2024)
Developed a Netflix Data Analytics project using Python, Pandas, and NumPy for data cleaning, preprocessing, and statistical analysis.
Performed EDA (Exploratory Data Analysis) to analyze genres, ratings, release trends, and userrelated insights.
Built an interactive desktop interface using Python GUI libraries for data visualization and filtering.
Implemented CRUD operations, data transformation, and reporting features for efficient dataset management.
Created charts and visual reports to present analytical insights and trends from the Netflix dataset. CERTIFICATE
ENGLISH: Basic
AWS Academy Graduate Cloud Foundations
AWS Academy Graduate Cloud Web Application Builder