Ho Chi Minh,Viet Nam
********************@*****.***
Nguyen Huy Bao
Data Science Intern
https://github.com/nguyenhuybao1108?tab=repositories
Experience
1.IT audit Intern at PwC
ITGC for yuanta,CFYC.
Data and analytics for suntory pepsico marketing.
Time-sheet charge visualization for internal firm using python and power bi.
2.Teaching assistant at english center (vinalearn).
Prepare materials before class
Convey teacher’s lesson to student
Liaise with the student’s parents about student’s performance.
3.Teaching assitant in courses like probability and random process, regression analysis, statistical methods.
help learner can see the big picture of the course after that the concept behind each equation.
Skill
Languages :
Python (pandas,numpy,scipy,statsmodel,plotly,matplotlib,seaborn,sklearn)
R
Java
SQL (good at writing subqueries and window function)-> I did write a complicated query to find winstreak of football club ( https://github.com/nguyenhuybao1108/winstreak )
Html,css,javascript ( build the visualization dashboard )
Good knowledge in Linear and Logistic regression,KNN,kmeans,dbscan,SVM,Decision Tree,Random Forest,Naïve Bayes,… (I know how to tune these models) -> KNN I use it for fill in missing value beside other basic imputation method,Decision tree for classification and I also use it for feature engineering when binning,…
Projects
Data Wrangling with R.
Use pyspark to EDA, Preprocess data(fill missing value,handle outlier,scale data,onehot encoder), crossvalidation.
Build the restaurant database, and some advance query to extract insight.
Build the interactive chart for car sales dataset using d3 js.(I can also use visualized tools like Tableau,Power BI)
Education
Senior in VIETNAM NATIONAL UNIVERSITY HCMC INTERNATIONAL UNIVERSITY
Activities
Build the credit scoring to evaluate the business then have action to give or reject the business loan.
Web scrape using Beautiful Soup and Selenium.
Design a basic chatting system ensure that the message is confidentiality and integrity combining encryption and HMAC.
ETL process from raw file to mysql.( https://github.com/nguyenhuybao1108/importcsvtomysqldb )
Now I am find an interest in time series analysis
Hobbies
I am enthusiatic with football