Post Job Free
Sign in

Data Pipeline Etl

Location:
Quan 1, 710000, Vietnam
Salary:
4-500000
Posted:
September 24, 2023

Contact this candidate

Resume:

q

q

**** - ****

**** - *******

/

Powered by

SUMMARY

Final-year Computer Science student at the University of Information Technology, specializing in data engineering. I am currently seeking practical experience to develop myself as a data engineer.

BASIC SKILLS

Programming languages

Python C/C++ Javascript

DevOps

Git Docker

Big data tech

Spark Prefect Airflow ETL/ELT Kafka

PROJECTS

Twitter ETL Data pipeline

https://github.com/nhattan040102/Twitter_data_pipeline A basic ETL pipeline concept project which collects realtime tweets from the Twitter API mentioning or from six selected world leaders MongoDB: served as the project's Data Lake, where raw tweets collected through the Tweet Collector are stored for initial storage and preservation Postgres SQL: served as the Data Warehouse, where processed tweets are stored after sentiment analysis and timestamping. Docker: the project utilizes Docker for containerization, enabling seamless deployment and scalability of components within the ETL pipeline. Twitter API: the pipeline integrates with the Twitter API to continuously gather tweets related to world leaders in real-time, ensuring a steady influx of data.

Spotify ETL Data pipeline

https://github.com/nhattan040102/Spotify_data_pipeline A data pipeline that transforms raw data from the Spotify API into a structured and query-ready dataset, enabling insightful data analysis and visualization

Airflow: Apache Airflow played a pivotal role in orchestrating and automating the ETL (Extract, Transform, Load) processes, ensuring data was efficiently retrieved, transformed, and loaded into MongoDB. MongoDB: MongoDB served as the project's database solution, providing a flexible and scalable storage system for housing the structured Spotify data, making it readily accessible for analysis and reporting. Spotify API: served as the primary source for retrieving user streaming data, which was then processed for analysis and visualization within the ETL data pipeline.

EDUCATION

High School Diploma

Trường THPT số 2 An Nhơn

Bachelor of Computer

Science

Trường đại học Công nghệ

Thông tin

GPA

8.48 10

STRENGTHS

Problem-solving

Good at overcoming drawbacks and solving

problems in an innovative and fast way.

Curiosity

Learning is a never-ending process. Consider

myself a person with a burning desire to know

and understand concepts and ideas.

www.enhancv.com

E

q e

TRAN NGUYEN NHAT TAN

Data Engineering Intern

089******* *******.******@*****.***

https://github.com/nhattan040102 Linh Trung, TP.Thủ Đức



Contact this candidate