Post Job Free
Sign in

Data Science

Location:
Arvada, CO
Posted:
August 06, 2025

Contact this candidate

Resume:

Joshua Felce

+1-303-***-****

***********@*****.***

https://www.linkedin.com/in/joshuafelce/

https://github.com/JF400

Skills

• Python (OOP, Pandas, NumPy, Matplotlib,

Seaborn)

• SQL (MySQL, PostgreSQL, SQL Server)

• Apache Spark (PySpark, Spark DataFrames,

MLlib, Spark SQL, RDD, Streaming)

• Git/GitHub/GitHub Desktop

• Deep Learning (Keras, TensorFlow)

• Machine Learning (Scikit-Learn)

• Statistics (ANOVA, T-tests, PCA, Bayesian Inference)

• Excel

• Calculus/Linear Algebra (Matrices, Gradient Descent)

• English (native), Spanish (native)

Education

MASTER OF SCIENCE IN DATA SCIENCE – University of Denver, CO, USA – GPA: 4.0/4.0 2023 – 2025 Courses: Database Organization & Management, Parallel & Distributed Computing, Python Software Development, Deep Learning, Machine Learning, Probability and Statistics 1 & 2, Algorithms for Data Science, Data Visualization BACHELOR OF SCIENCE IN MATHEMATICS AND PHYSICS – University of Calgary, AB, Canada – GPA: 3.5/4.0 2018 – 2023 M.S., Data Science Projects

RETINAL SCAN MULTI DISEASE CLASSIFICATION April 2025 – June 2025

● The dataset consisted of 3,544 retinal scan images, each annotated for the presence of 7 eye diseases. A retinal scan may be associated with multiple diseases simultaneously.

● The task was to build a multi-label classifier that identifies the specific eye diseases present in a scan. Two different approaches were employed to achieve this task: The binary relevance method from the scikit-multilearn library and a single convolutional neural network designed for multi-label classification.

● The first technique decomposes the multi-label task into 7 independent binary classification problems—training a separate convolutional neural network (deep learning) and a separate support vector machine or random forest (machine learning) for each disease. F1 threshold maximization was employed for this technique.

● The second technique involved a single convolutional neural network with 7 neurons in the output layer—one for each disease— that were activated by the sigmoid function to produce independent probabilities for each disease.

● The project showcases the use of Python, Deep Learning (Keras, TensorFlow), and Machine Learning (Scikit-Learn). VIDEO GAME DATABASE November 2024 – December 2024

● Designed a video game database model: A Star Schema consisting of a central fact table surrounded by multiple dimension tables.

● The data being modeled consisted of the name of the video game, release year, platform, publisher, genre, and sales numbers in various regions of the world.

● Created 4 stored procedures using MySQL to answer common queries on the dataset. BOOK DATABASE October 2024 – November 2024

● Created a Graphical User Interface (GUI) using Python and SQL where the Python library Tkinter was used to create the GUI and connected to a MySQL database using the Python Connector Interface.

● Users can interact with the GUI to search for books by author, category, or publisher. Work Experience

UNDERGRADUATE STUDENT RESEARCHER – University of Calgary April 2021 – September 2021

● Awarded a grant to conduct and then write a report on a research project done within the Department of Mathematics.

● Used Python to implement two numerical methods to solve a Partial Differential Equation: Centered Finite Difference Method and a Physics Informed Neural Network (built using Keras and TensorFlow). Work Authorization

Authorized to work in the U.S. under OPT (36 months); TN visa available for continued employment (Canadian Citizen).



Contact this candidate