Xiaomei Song
Albany, CA ***** 510-***-**** ad10nr@r.postjobfree.com LinkedIn Github
EDUCATION
UNIVERSITY OF CALIFORNIA, BERKELEY
COMPUTER SCIENCE B.A (DATA SCIENCE) GPA 3.87
EXPERIENCE
Lead Database Architect, EpiNu Nutrition Security Engine Sep 2023-Dec 2023 UC Berkeley Data Science Discovery Research Program Berkeley, CA
● Orchestrated the design and optimization of a scalable database system for EpiNu, boosting data efficiency by 30%
● Implemented a comprehensive relational database model using ER diagrams and normalization techniques Data Science Modules Developer Aug 2023-Present
UC Berkeley's College of Computing, Data Science and Society Berkeley, CA
● Designed a NLP model, and utilized predictive modeling, generalization and text mining to analyze unstructured legal text
● Analyzed police department stop data using clustering, decision trees, random forests, causal inference, and supervised learning
● Created Python script using Smith-Waterman algorithm for DNA sequencing in bioinformatics research Data Science Consultant UTech Aug 2023-Present
UC Berkeley D-Lab Berkeley, CA
● Delivered Python-based workshops on machine learning and data transformation
● Provided real-time assistance and expert consulting, improving client satisfaction Data Management System Project Aug 2022-Dec 2022
UC Berkeley Data Management CS186 Course Berkeley, CA
● Led team of three in creating open-source RDBMS with distributed relational databases, data warehousing, architecture, modeling, data flow, mining, indexing, query optimization, and transactions
● Implemented data storage with multi-granularity data lockers for confidentiality, data integrity, security, and privacy controls AI and Machine Learning Project Pac-Man AI Game January 2022 - May 2022 UC Berkeley AI CS188 Course Berkeley, CA
● Managed data structures, applied statistical methods for data preparation, cleaning, normalization, handling missing data, outlier detection, and feature scaling.
● Utilized supervised and unsupervised learning algorithms for data analysis, applying statistical techniques, evaluating models using performance metrics, and hyperparameter tuning.
● Implemented Q-Learning & Function Approximation for autonomous Pac-Man using Deep & Reinforcement Learning. Data Pipeline Analyst & Business Analyst Feb 2017 - Aug 2022 Good Farmer LLC Albany, CA
● Led diverse cross-functional teams in analyzing, planning, and executing strategic initiatives aligning with organizational goals
● Analyzed data to discern market trends, foster product innovation and refine business strategies, securing 50 e-commerce clients
● Founded a global import-export business, prioritizing streamlined supply chain operations and customer value management Data Scientist & Analyst Aug 2009 - July 2015
Beijing Tomorrow Company Beijing, China
● 6+ years experience in data-focused analytics for the finance industry, specializing in statistical inference, provided project development, delivery, ongoing customer analytics support across banking, insurance, stock, and futures markets
● Reduced risk by 25% through quantitative analysis and prediction using commercial bank big data, employing data analytics, visualization methods, and business intelligence tools for decision-making
● Utilized performance metrics, KPIs, and stakeholder management to measure risk reduction effectiveness
● Aligned on analytics and business requirements with stakeholders, informing strategic goals with data-driven and insights LEADERSHIP
Mentor, Pioneers In Engineering, Berkeley, CA Aug 2022 - May 2023
● Led a group of 6 students in designing and building a remote-controlled robot for competition Founder and volunteer, Sweet Bee Community Support, Albany, CA March 2020 - Sep 2021
● Initiated and oversaw PPE (Personal Protective Equipment) production, orchestrated donation events, and championed engagement of 200+ volunteers, 50+ clinicians, and 5+ partner organizations, resulting in $12,000 raised SKILL
● Programming Languages: Python, Java, SQL, C, C++, Javascript, Scheme, HTML
● Database: MongoDB, MySQL, PostgreSQL, NoSQL, Hadoop, Spark, AWS
● Libraries: NumPy, Pandas, Matplotlib, Seaborn, Scikit-learn, PyTorch, Masterpom, Stdlib, Xchart; Frameworks: TensorFlow;
● Software Development: Restful API design, Object-Oriented Programming, System Design, ERP, UI, CI/CD
● Tools: IDEs, Git, GitHub, Excel, Jupyter Notebook, Agile, Project Management, Technical Communication, Problem Solving