Post Job Free
Sign in

Junior Data Scientist R, Python, SQL

Location:
Kings County, NY
Posted:
July 25, 2017

Contact this candidate

Resume:

Royce Ho

Brooklyn, New York 112**-***-*** **** r ******@*****.***

github.com/royh21k b log.nycdatascience.com/author/royh21k l inkedin.com/in/royce-ho/ SUMMARY

Data scientist skilled in data munging, feature engineering and modeling. A quick learner who is able to adapt to any type of work environment. Great at summarizing details into high level concepts. Enjoys problem solving. SKILLS

Data Science: R (dplyr, knitr, tidyr, caret, ggplot2, shiny) Python (scipy, numpy, pandas, scikit-learn, selenium, matplotlib) SQL (MySQL) NoSQL (MongoDB) Machine Learning: K-Nearest Neighbors Generalized Linear Models Decision Trees Naive Bayes Support Vector Machines Principal Component Analysis Clustering Natural Language Processing Association Rules

Working knowledge: Apache Hadoop (MapReduce, Hive, Pig) Amazon Web Services Apache Spark (SQL, MLLIB) Databricks Time Series Analysis Neural Networks EDUCATION

Certificate in Data Science, NYC Data Science Academy, New York, NY 2017

● Shiny Web App: Developed an interactive web app to help users visualize and understand the relationship between eating patterns and general health for the American population

● Web Scraping: Used Selenium to gather player and team data from the National Basketball Association for analyzing player and team performances in wins and losses and in home and away games. The results can be used to infer which players have the greatest impact on wins or losses.

● Kaggle Machine Learning: Made prediction models for classifying interests in apartment listings on an apartment searching site hosted by RentHop. Utilized natural language processing, k-means clustering, and basic image processing to create features for the dataset. Applied and ensembled multiple machine learning algorithms (logistic regression, random forests, and gradient boosting) to make predictions.

● Capstone: Used NBA player and team data to predict player performance in daily fantasy sports

(DraftKings). Created new advanced statistics. Combined time series models with gradient boosting models to approximate player performance. Generated suggested lineup using a genetic algorithm to solve the limited salary and position type knapsack optimization problem set by DraftKings. Placed in 78% of the contests joined.

Bachelor’s Degree in Chemistry, Johns Hopkins University, Baltimore, MD 2009 - 2012

● Awarded G reer Undergraduate Research Award for Research in organic and nucleic acid chemistry

● Synthesized and analyzed a library of inhibitors for DNA polymerase β at the Greenberg Group to drive improvements in cancer research

EXPERIENCE

Teacher’s Assistant, E-Math, Brooklyn, NY 2013 - 2017

● Prepared students for standardized testing for grade school (1-12)

● Lectured fundamentals of mathematics (logic, algebra, trigonometry, geometry, statistics, and calculus) Laboratory Coordinator, Clinilabs, Inc., New York City, NY 2015 - 2016

● Collected, processed and handled biological substances

● Managed and maintained laboratory equipment and inventory Assistant Budget Analyst, U.S. Attorneys Office – EDNY, Brooklyn, NY 2008 - 2015

● Prepare documentation for generating budget reports and for auditing

● Communicated with vendors to update account statuses

● Managed invoices and vouchers and allocated funds for payment Volunteer Comfort Specialist, C oney Island Hospital, Brooklyn, New York 2014 - 2015

● Assisted in alleviating patient’s physical and mental stress



Contact this candidate