Post Job Free
Sign in

Data Analyst Class B

Location:
Chicago, IL
Posted:
January 13, 2023

Contact this candidate

Resume:

MATTHEW TOMBERS

Phone: 708-***-**** Email: ********@*****.*** LinkedIn: m_tombers GitHub: m_tombers

SUMMARY

Data Analyst technology enthusiast with a Bachelor’s degree in Computational Physics - Mathematics. Recently completed a Data Science fellowship; growing skills in data analysis, statistical modeling, Python, and SQL.

Leveraging experience in technical communication and a passion for metrics, performance, and tuning; I aim to land a role working in Data Analytics - Machine Learning - Software Development.

TECHNICAL SKILLS

Languages: Python (pandas, NumPy, Seaborn, Matplotlib, scikit-learn)

Machine Learning: Linear Regression, Logistic Regression, Decision Tree, RandomForest, Xgboost, Support Vector Machine, K-Means

Statistical Methods: Regression, Classification, Clustering, Statistical learning, Hypotheses testing

Predictive Modeling: Google Cloud Platform

Database Management: SQL

Data Visualization: Tableau

PROJECTS

Strawberry Classification Model

●Created an image classification model for strawberries using Google Cloud Platform.

●Labeled 400 images of strawberries class A (clear images - less leaves) or class B (blurred images - more leaves) and trained a classification model to predict the class given an unlabeled image.

●Model metrics: accuracy = 0.91, precision = 0.84, recall = 1

●Tools: Kaggle, Python (pandas, NumPy, Matplotlib), GCP Auto ML computer vision

Automated farming vs. Traditional methods

●Conducted an efficiency analysis to determine if automated Controlled Environment Agriculture (CEA) is more efficient than traditional farming methods.

●Using data from AutonomousGreenhouseChallenge2019 - Compared automated CEA vs. current commercial production methods of cherry tomatoes.

●Automated CEA consumed: 1.1% more electricity, 5.6% more CO2, 5.9% more water, 87.1% less heat, and produced 5.1% more fruit compared to the current commercial method.

●Tools: CEA Open Data, Python (pandas, NumPy, Matplotlib)

EXPERIENCE

Kaggle competition House Prices - Advanced Regression Techniques 2022

●Create a pricing model using provided housing data set

●Random Forest Regressor model Mean Absolute Error: 21,218

Kaggle competition Titanic - Machine Learning from Disaster 2021

●Build a model to predict the fate of passengers on the Titanic

●Random Forest Classifier model : 77.5% accurate

Data Science Fellow Springboard 2020 - 2021

●Data Analytics

●Machine Learning

●Python, SQL, statistics

EDUCATION

Illinois State University Normal, IL

Bachelor of Science in Computational Physics Mathematics minor

CERTIFICATIONS

Data Science Career Track Springboard 2021



Contact this candidate