Chi (David) Yu
San Francisco, CA ***** 415-***-**** *******@*****.*** https://www.linkedin.com/in/chi-yu/ PROFESSIONAL EXPERIENCE
Megalabs.ai, Palo Alto CA — Data Analyst/Scientist Intern September 2024 - Present
● Built interactive Tableau dashboards to visualize AI/ML model performance and benchmark analyses.
● Curated and optimized large scale multimodal datasets with python for training and evaluation.
● Used SQL and Python (Pandas, NumPy, scikit-learn) to conduct exploratory data analysis (EDA) and generate insights.
● Automated Python scripts for data validation and anomaly detection, improving data integrity. EZ Texting, San Francisco CA — Data Analyst Engineer September 2022 - August 2024
● Developed SQL queries and stored procedures to optimize db performance and reporting workflows.
● Designed and automated Tableau dashboards for key business and customer insights.
● Migrated MySQL databases to BigQuery, improving data reliability and scalability.
● Used Python (Pandas, Matplotlib, Seaborn) for trend analysis and reporting automation. TEKsystems Inc., Menlo Park CA — Data Analyst Engineer @ Meta February 2021 - August 2022
● Developed Python-based data pipelines to extract, transform, and load (ETL) at scale datasets.
● Developed Unidash dashboards for business intelligence reporting and stakeholder analysis.
● Conducted data validation and quality checks, ensuring consistency across Dataswarm, Presto, and CDM.
Upped Events Inc., Philadelphia PA — Data Analyst/Scientist Intern October 2020 - January 2021
● Developed Python scripts for data preprocessing and feature engineering to enhance models.
● Designed SQL-based event analytics dashboards to track user engagement and campaigns.
● Conducted data preprocessing and feature engineering to support business insights. Blast Inc., San Francisco CA — Data Analyst Scientist Intern April 2020 - August 2020
● Created SQL-based reports and visualizations for customer retention analysis.
● Developed and monitored KPI dashboards to track engagement trends.
● Built and optimized Python scripts for ETL workflows, improving data pipeline efficiency. SKILLS
● Programming & Query Languages: Python (Pandas, NumPy, scikit-learn), SQL (PostgreSQL, MySQL, Presto)
● Business Intelligence & Visualization: Unidash, Tableau, Looker, Dashboard Storytelling
● Database Management: BigQuery, MySQL, Presto, Dataswarm, CDM, dbt, Airflow, Spark
● Machine Learning (for Data Analysis): Transfer Learning, LSTMs, RNNs EDUCATION
Galvanize Inc., San Francisco, CA — Data Science Immersive Certificate November 2019 - March 2020
Columbia University, New York — M.S. & M.A. in Mathematics Education September 2005 - May 2009
New Jersey Institute of Technology, Newark — B.S. in Applied Mathematics and Mechanical Engineering
September 1999 - May 2005