Dayou (Wally) Wu
********@********.*** ***** NE *0th St, Bellevue, WA 217-***-**** LinkedIn
SUMMARY
Actively seeking full-time Data Scientist positions starting July 2024. Self-motivated and data-driven Engineer with hands-on experience in Data Engineering, Software Development, and Machine Learning Research
Programming Skills: Python, SQL, Tableau, Power BI, PyTorch, AWS, Hugging Face, GCP
Database Skills: MySQL, PostgreSQL, AWS OpenSearch, AtlasDB, MongoDB, Hadoop, Spark, Neo4j EDUCATION
University of Illinois at Urbana-Champaign May 2024
Master of Computer Science (Aug 2023 – May 2024) Average GPA: 3.70
Bachelor of Science in Computer Science (Aug 2019 – May 2023) Average GPA: 3.69 INTERNSHIP EXPERIENCES
UIUC Coordinated Science Laboratory Urbana-Champaign, Illinois Research Assistant (Temporal Reasoning in LLMs) May 2023 – Present
Analyzing ideology influence pathway through Graph Convolutional Network among over 420 thousand Twitter users, securing a 40 million USD investment
Crawling post information on multi-platform through selenium automation requests and streamlined it into PostgreSQL for subsequent neural network construction
Developing a front-end data visualization of ideology heatmap and timeseries data through React.js
Cleaning noisy, informal Twitter Dataset and identifying events based on modified TF-IDF filter Visa Inc. Austin, Texas
Full Stack Data Engineer Intern May 2022 – Aug 2022
Developed and implemented automated Python scripts to troubleshoot pre-maintenance issues in a Hadoop data system, reducing database maintenance time from several days to mere minutes
Engineered SQL tables to organize different workflows into multiple tables for query optimization
Composed automated emails using HTML, CSS, and Python to report results
Collaborated across departments to demonstrate work achievements and coordinate data permissions PROJECT EXPERIENCES
(ML+Computer Vision) PawMeme, Automated Pet Meme Generator Urbana-Champaign, Illinois Data Engineer, Machine learning Engineer Jan 2024 – May 2024
Created an automated meme generator for pet photos based on Convolutional Neural Network, with 87.8% accuracy in pet sentiment analysis, out-performing the state-of-the-art Facial Expression Recognition models
Implemented object detection mechanisms to capture pet photos in videos
Integrated the algorithm into Raspberry Pi device for pet monitoring and automatic meme generation
(ML+NLP – Workflow Automation) CAII 2024 Ashby AI Hackathon Urbana-Champaign, Illinois Full-Stack Software Engineer, LLM Researcher Apr 2024 – Apr 2024
Built a Workflow Automation tool leveraging LangGraph, enabling Large Language Models (LLMs) to autonomously manage workflows, wining second place in the Ashby AI Hackathon
Fine-tuned an unsupervised (LLM) reasoning model through zero-shot training and testing
Evaluated the operational efficiency of each component using LangSmith metrics, optimized low-performance elements through parallelization, achieving a 90% reduction in execution time
(NLP – User Behavior Prediction) IEEE INCAS Competition Urbana-Champaign, Illinois LLM Model Developer Aug 2023 – Jan 2024
Achieved a 64% performance improvement over the baseline model by predicting 300,000 user engagement pairs in hashtag clusters through self-developed python Logistic Regression model
Clustered thousands of Twitter hashtags using K-means and TwHin-Bert LLM embeddings
Developed RNN, LSTM, Replay models for performance evaluation on multilingual datasets
Co-authored a paper on social media influence graph mapping, got accepted by the IEEE-Trans conference