Post Job Free
Sign in

Data Science Computer

Location:
Berkeley, CA, 94702
Posted:
February 14, 2024

Contact this candidate

Resume:

Sophie Cui

Berkeley, California 510-***-**** *********@********.***

Motivated and analytical Computer Science and Mathematics student with experience in programming, data engineering and analysis, consulting, and research. Skilled in Python, Java, and SQL, and proficient in using Tableau, Git, Jupyter, and Microsoft Office. Seeking a challenging role to apply my skills and contribute to innovative projects. EDUCATION

University of California, Berkeley Expected Graduation: May 2025 BA Computer Science; BA Applied Mathematics GPA: 3.55/4.0 Relevant Coursework: Introduction to Artificial Intelligence, Machine Structures, Data Structures, Structure and Interpretation of Computer Programs, Principles and Techniques of Data Science, Foundations of Data Science, Introduction to Complex Analysis, Numerical Analysis, Introduction to Abstract Algebra, Linear Algebra, Introduction to Analysis, Introduction to Mathematical Economics, Concepts of Probability PROFESSIONAL EXPERIENCE

Kaiser Permanente Information Technology June 2023 – August 2023 Platform Engineering Intern Pleasanton, California

● Created modernized data visualization feature for Kaiser’s digital notification platform using Tableau that enabled all internal teams to view and access data from the ~3 million notifications generated by the platform on a daily basis.

● Built pipeline using ADLS Gen2 and CosmosDB to automate the previously manual generation operational reports. Blindspot Solutions June 2022 – August 2022

Machine Learning Development Intern Prague, Czech Republic

● Worked with network performance data and anomaly detection models like ARIMA, DeepAnT, and ADTK.

● Determined the best model through comparing and analyzing their compatibility and effectiveness on project data. PROJECTS AND RESEARCH

Big Data at Berkeley August 2022 – Present

Data Consultant for Valuenex Berkeley, California

● Scraped data on 65,044 companies from Twitter to use as features in building a multivariate gaussian model that predicts the potential success of a startup for investment purposes using Valuenex’s Startup Finder.

● Included web-scraped results from TechCrunch with additional information on companies into the Startup Finder. Data Consultant for Ulta Beauty

● Converted NoSQL customer transaction data into SQL data and categorized products into 10 groups.

● Used RFM Analysis on product categories and the tenure of a customer to represent each customer with 31 features.

● Experimented with different clustering and dimensionality reduction techniques to optimize silhouette score. Berkeley Data Science Discovery Project August 2022 – December 2022 Research Assistant for Applied Materials Berkeley, California

● Processed anonymized data corresponding to 216 of Applied Materials’ semiconductor wafers at 2600+ time points.

● Developed linear regression models with >85% accuracy that predicted processing chamber conditions and wafer characteristics given a set of input conditions in order to maintain ideal product characteristics. Texas State University Summers 2020 – 2021

Research Assistant San Marcos, Texas

● Developed network-based mathematical approaches for predicting spread of infectious diseases by tracing the spread of the coronavirus in Jilin, China using graph theory and time series analysis under Professor Alex White.

● Analyzed sentiment trends in promotional networks using graph theory under Professor Jelena Tesic. SKILLS

Python (Pandas, Numpy, scikit-learn, matplotlib), Java, SQL, Tableau, Git, Jupyter, Microsoft Office, Mandarin Chinese



Contact this candidate