Post Job Free
Sign in

Data Engineer Big

Location:
Quan 12, 71500, Vietnam
Posted:
October 20, 2024

Contact this candidate

Resume:

**/****: Top * in University Data Science Competition

Data Science 2021 - 2024

Nguyen Tat Thanh University

I am seeking a Data Engineer position where I can apply my knowledge of big data processing, pipeline development, and data optimization solutions to help businesses streamline their processes and make data-driven decisions. I am constantly learning new technologies like Big Data, Cloud Computing, and Machine Learning to enhance my personal skills. LUONG CONG THUAN

An Phu Dong, District 12, Ho Chi Minh City 039******* ***************@*****.***

DATA ENGINEER INTERN/FRESHER

Programming Languages: Python, SQL

Tools & Platforms: Apache Spark, Hadoop, Airflow, Mage Databases: MySQL, PostgreSQL, BigQuery

Cloud: AWS (S3, Redshift, EMR), Google Cloud Platform (GCS, BigQuery) Data Warehousing: Data Modeling, ETL Pipelines, OLAP, OLTP Data Visualization: Tableau, Power BI, Looker

Other Technologies: Terraform, Docker

SKILLS/ TECH

EDUCATION

AWARDS & ACHIEVEMENTS

CAREER OBJECTIVE

Montgomery County of Maryland - Warehouse and Retail Sales September 2024 GitHub: Retail Sales Project

Built ETL infrastructure on GCP using Terraform, configured GCS and BigQuery. Deployed Mage to automate ETL processes, collecting and transforming data from CSV. Stored data in Parquet format in GCS, processed it locally with Spark. Modeled data in a star schema and loaded it into BigQuery for OLAP analysis. Used LookerStudio for dashboard data visualization. PROJECTS

Strong teamwork skills

Adequate problem-solving skills

Good English reading and comprehension, conversational level SOFT SKILLS

Smart city - IoT Data Processing System with Real-time Streaming October 2024 GitHub: IoT Data Processing/Streaming System

Collected real-time IoT data during a vehicle journey from Ho Chi Minh City to Da Lat. Built a real-time IoT data pipeline using Docker, Apache Kafka, and Apache Spark. Integrated AWS S3, Glue, Athena, and Redshift for data storage and querying. Utilized DBeaver for Redshift management and data processing



Contact this candidate