Post Job Free
Sign in

Data Scientist

Location:
Houston, TX, 77094
Posted:
July 04, 2020

Contact this candidate

Resume:

Stacy Shingleton

Houston TX, ***** 281-***-**** adebz0@r.postjobfree.com GitHub Medium LinkedIn Data Scientist

Experience in data acquisition and data modeling, statistical analysis, machine learning, deep learning, and NLP. With over five years industry experience in specialty polymer research and development, I bring strong skills in problem solving and project management, and possess proven communication skills with team members and customers. TECHNICAL SKILLS

Python, SQL, scikit-learn, NumPy, pandas, seaborn, matplotlib, Beautiful Soup (web scraping), Surprise, Machine Learning Models

(Classification and Regression), Statistical Analysis, API Calls, NoSQL (MongoDB), Docker, Git, Spark, PySpark, OSEMN, AWS, JSON, CRISP-DM, Data Mining

TECHNICAL PROJECTS

Building a Regression Model - Github

Built a regressor to predict lifetime box office gross to assist investors and production companies in budgeting movies

● Scraped three HTML sites using the Python library Beautiful Soup

● Converted scraped data to data frames using JSON and pandas

● Cleaned and converted categorical data to binary data using various encoding methods such as MultiLabelBinarizer

● Completed exploratory data analysis on regression data using pandas and seaborn

● Built and scored a variety of regressors with feature importance plots Building a Classification Model - Github

Built a classifier to predict flight delays for airlines to reduce operational costs and emissions

● Imported and filtered a raw dataset that contained information on 5.8 million flights

● Completed an API call to collect hourly weather data which was then joined onto the flights dataset

● Cleaned and converted categorical data to binary data using various encoding methods such as OneHotEncoder

● Completed exploratory data analysis on classification data using pandas and seaborn

● Built and scored a variety of classifier models with feature importance plots and confusion matrices Building a Recommendation System - Github

Built a recommendation system that provides a list of recommended movies for new users

● Extracted and joined datasets containing user reviews and movie titles

● Cleaned and completed exploratory data analysis using pandas and seaborn

● Built a recommendation system using python SciKit Surprise which provides new users with movie recommendations Hypothesis Testing with SQL - Github

Completed hypothesis testing using statistical analysis to provide valuable sales and marketing insights for a grocery provider

● Explored an Entity Relationship Diagram (ERD)

● Created functions to calculate the t-statistic, degrees of freedom, and p-value for a Welch’s t-test

● Completed numerous SQL queries to obtain the necessary data from the ERD for statistical analysis

● Completed Welch’s t-test on data distributions to either reject or fail to reject various null hypotheses EMPLOYMENT HISTORY

Senior Technical Associate, Kraton Corporation, Houston, TX 01/2014 - 12/2019

● Designed and built the first pressurized polymerization reactor at the Kraton Innovation Center

● Oversaw a capital project with a team of six colleagues and managed a 100K budget while meeting deadlines

● Worked directly with customers developing novel polymers for new business developments

● Organized and presented annual sitewide OSHA safety trainings to over 50 employees

● Certified Lean Six Sigma Yellow Belt

EDUCATION

Flatiron School, Online 01/2020 - 06/2020

Immersive Data Science Bootcamp program

The University of Texas, Austin, TX 08/2009 - 12/2013 Bachelor of Science in Chemistry



Contact this candidate