Sign in

Social Media Data

Fairfield, California, United States
December 12, 2018

Contact this candidate


Leon Rafael Gutierrez Angulo (650) 290-0580_ EDUCATION University of California, Berkeley

Applied Mathematics; emphasis in Operational Research August 2018 Spanish Language and Literature GPA: 3.55/4.0


• Principles and Techniques of Data Science

(Python, SQL, Apache Spark)

• Structure and Interpretation of Computer

Programs (Python, SQL, Scheme)

• Concepts in Computing with Data (R)

• Data Structures (Java)

• Introduction to Database Systems (Java)

• Concepts in Probability


• Languages:

o Python (Libraries: Pandas, NumPy, Matplotlib, Scikit-learn) o R (Packages: ggplot2, ggthemes, shiny, R Markdown, dplyr, tidyr, stringr, XML, DT, boot, class, pls, stat) o Java


• Database Systems: PostgreSQL

• Software: Excel, Power Point

• Others: Apache Spark, Jupyter Notebooks, Scheme

• Machine Learning: Linear regression, logistic regression, cross-validation, LOOCV, Ridge regression, Principal Component Regression (PCR), Partial Least Squares Regression (PLSR), Regression splines JOB EXPERIENCE FLX Bio, South San Francisco, CA

Computational biology intern, October 2018 – present

• Polish and update their web-based data mining tools to provide the best data analysis functionality to my FLX colleagues.

• Implement advance data mining on some of their immune oncology data. ACADEMIC PROJECTS Priority queue (Java), Data Structures UC Berkeley Spring 2018

• Designed a priority queue using a binary min-heap. Puzzle (Java), Data Structures UC Berkeley Spring 2018

• Developed an artificial intelligence that implemented the A* search algorithm to solve puzzles, also known as ‘best first search’.

Trump, Twitter, and Text (Python), Principles and Techniques of Data Science, UC Berkeley Fall 2017

• Analyzed actual data from the Twitter API. We used tweepy to download the tweet data into my server, so that I could convert it into a Pandas dataframe for further analysis.

• Found out that that many of the tweets were not from Trump himself, but from his staff. There is a big among of tweets from iPhone before his inauguration on January 20th, but according to the White House director of social media Dan Scavino Jr. Trump switched to the Apple device ahead of his inauguration. Spam/Ham Prediction (Python), Principles and Techniques of Data Science, UC Berkeley Fall 2017

• Designed a classifier that could distinguish spam emails from ham emails with an accuracy of 97% using the LogisticRegression classifer from the scikit-learn package. SQL FEC Data, and Small Donors (SQL), Principles and Techniques of Data Science, UC Berkeley Fall 2017

• Explored the Federal Election Commission’s data on the money exchange during the 2016 election using SQL queries.

PERSONAL PROJECTS KKN Classifier (Python),

• Implemented a KKN classifier using two functions, one that calculates the distance between two points, and the other one that uses this distance to find the k neighbors closest to the given point. FOREIGN LANGUAGE SKILLS Fluent in Spanish

Contact this candidate