Hayoung Son, Data Scientist
California, *****, United States, +1-213-***-****, ********@***.***, www.linkedin.com/in/hayoung- sonLinkedIn
SUMMARY Dynamic Data Scientist with a solid foundation in data analysis and model development, backed by over 4 years of research experience. Proficient in utilizing advanced machine learning techniques, including BERT and GPT for synthetic data evaluation. Successfully led interdisciplinary projects and engaged in novel research on emotion analysis through YouTube comments. Adept at fostering collaboration and enhancing research methodologies, ready to leverage unique insights and technical skills to drive data-driven solutions for organizations.
WORK EXPERIENCE
01/2024 – 05/2024 Researcher, Leader, Data First Project, Center for Knowledge-Powered
Interdisciplinary Data Science
California, United States of America
Collaborated with Professor Yolanda Gil and Marjorie Freedman to spearhead the Data First Project, amplifying interdisciplinary research initiatives.
Executed comprehensive research on generating Synthetic Data using Large Language Models.
Engineered models to evaluate the quality of synthetic versus real data utilizing various fine-tuned BERT and GPT models.
10/2020 – 09/2023 Leader of KUMO Seminar, Professor Jun Murai's Research Group, KEIO UNIVERSITY
Tokyo, Japan
Operated under the mentorship of Professor Shigeya Suzuki and Professor Thamrin Achmad Husni to revamp research methodologies and accomplish project objectives.
Co-facilitated the KUMO-Bcali Joint Meeting: Reading Circle on Software, promoting collaborative discussions and enhancing participants’ comprehension of software topics.
Conducted research on YouTube Comments Emotion Analysis by deploying HuggingFace's KoBERT open-source model.
03/2022 – 07/2022 Teaching Assistant for Probability Macroeconomics 2, Keio University
Tokyo, Japan
Managed online class materials on the SOL Canvas Website, ensuring students had prompt access to essential resources. Fostered interactive communication with students, guiding active discussions that bolstered understanding and participation in class. Orchestrated class schedules and assignment deadlines while ensuring the accurate upload of information on the SOL Canvas Website. EDUCATION
01/2024 – Present University of Southern California Masters, Applied Data Science
California, United States
Coursework includes Machine Learning for Data Science, Database Systems, Foundations of Data Management, Predictive Analysis. 09/2019 – 09/2023 KEIO UNIVERSITY
Bachelor of Arts, Environment and Information Studies Tokyo, Japan
Coursework includes Algorithm Science, Heuristic Computing, Fundamentals of Object-Oriented Programming, and more. SKILLS Python R
Java JavaScript
HTML/CSS MySQL
SQL Pandas
NumPy Scikit-learn
PyTorch Streamlit
Selenium MongoDB
Firebase NoSQL
AWS Git
Jupyter Notebook Google Colab
Excel Relational Databases
Data Visualization Data Analysis
Machine Learning Statistical Modeling
Collaboration Communication
Tech Industry Experience Automation
Power BI UX
Adaptability
LANGUAGES English (Native) Korean (Native)
Japanese (Intermediate) Chinese (Beginner)