Sophie Cui
Berkeley, California 510-***-**** *********@********.***
Motivated and analytical Computer Science and Mathematics student with experience in programming, data engineering and analysis, consulting, and research. Skilled in Python, Java, and SQL, and proficient in using Tableau, Git, Jupyter, and Microsoft Office. Seeking a challenging role to apply my skills and contribute to innovative projects. EDUCATION
University of California, Berkeley Expected Graduation: May 2025 BA Computer Science; BA Applied Mathematics GPA: 3.55/4.0 Relevant Coursework: Introduction to Artificial Intelligence, Machine Structures, Data Structures, Structure and Interpretation of Computer Programs, Principles and Techniques of Data Science, Foundations of Data Science, Introduction to Complex Analysis, Numerical Analysis, Introduction to Abstract Algebra, Linear Algebra, Introduction to Analysis, Introduction to Mathematical Economics, Concepts of Probability PROFESSIONAL EXPERIENCE
Kaiser Permanente Information Technology June 2023 – August 2023 Platform Engineering Intern Pleasanton, California
● Created modernized data visualization feature for Kaiser’s digital notification platform using Tableau that enabled all internal teams to view and access data from the ~3 million notifications generated by the platform on a daily basis.
● Built pipeline using ADLS Gen2 and CosmosDB to automate the previously manual generation operational reports. Blindspot Solutions June 2022 – August 2022
Machine Learning Development Intern Prague, Czech Republic
● Worked with network performance data and anomaly detection models like ARIMA, DeepAnT, and ADTK.
● Determined the best model through comparing and analyzing their compatibility and effectiveness on project data. PROJECTS AND RESEARCH
Big Data at Berkeley August 2022 – Present
Data Consultant for Valuenex Berkeley, California
● Scraped data on 65,044 companies from Twitter to use as features in building a multivariate gaussian model that predicts the potential success of a startup for investment purposes using Valuenex’s Startup Finder.
● Included web-scraped results from TechCrunch with additional information on companies into the Startup Finder. Data Consultant for Ulta Beauty
● Converted NoSQL customer transaction data into SQL data and categorized products into 10 groups.
● Used RFM Analysis on product categories and the tenure of a customer to represent each customer with 31 features.
● Experimented with different clustering and dimensionality reduction techniques to optimize silhouette score. Berkeley Data Science Discovery Project August 2022 – December 2022 Research Assistant for Applied Materials Berkeley, California
● Processed anonymized data corresponding to 216 of Applied Materials’ semiconductor wafers at 2600+ time points.
● Developed linear regression models with >85% accuracy that predicted processing chamber conditions and wafer characteristics given a set of input conditions in order to maintain ideal product characteristics. Texas State University Summers 2020 – 2021
Research Assistant San Marcos, Texas
● Developed network-based mathematical approaches for predicting spread of infectious diseases by tracing the spread of the coronavirus in Jilin, China using graph theory and time series analysis under Professor Alex White.
● Analyzed sentiment trends in promotional networks using graph theory under Professor Jelena Tesic. SKILLS
Python (Pandas, Numpy, scikit-learn, matplotlib), Java, SQL, Tableau, Git, Jupyter, Microsoft Office, Mandarin Chinese