Mai Thanh Hoang
Data Engineer
**************@*****.*** — +84-886***-***
github.com/Haongg
SUMMARY
Aspiring Data Engineer with a strong foundation in Computer Science, focused on building scalable data pipelines and real-time streaming systems. Experienced with Spark, Kafka, and modern data stack tools. Committed to clean code, data reliability, and continuous technical growth. EDUCATION
Ho Chi Minh City University of Technology (HCMUT) 2023 – Present B.S. in Computer Science GPA: 3.4/4.0
PROJECTS
DDoS Detection System Jan 2026 – Mar 2026
Description: Developed a high-throughput system for real-time DDoS detection by analyzing network traffic logs, enabling immediate visibility into potential threats. Tech Stack: PySpark Streaming, Kafka, Elasticsearch, Redis, Docker, Grafana Outcome: Achieved sub-2-second detection latency while processing thousands of network events per second. Project Link: github.com/Haongg/DDos_Detection
NYC Taxi Data Warehouse Dec 2025 – Mar 2026
Description: Built an end-to-end data warehouse pipeline to automate ingestion, transformation, and analytics- ready data marts for KPI dashboards.
Tech Stack: Apache Airflow, PySpark, PostgreSQL, MinIO, dbt, Docker, Metabase Outcome: Automated batch workflows and improved data availability for analytics and reporting. Project Link: github.com/Haongg/NYC-Taxi-Datawarehouse SKILLS
Programming: Python (PySpark, Pandas, NumPy), SQL (MySQL, SQL Server, Oracle) Data Engineering: Apache Spark, Kafka, Airflow
Databases: MySQL, SQL Server, Elasticsearch, Redis Tools: Docker, Git, Grafana
ADDITIONAL
Languages: English (TOEIC 680), Vietnamese (Native) Soft Skills: Problem-solving, Teamwork, Communication, Time Management, Adaptability 1