NATHAN CHOO
650-***-**** ******@*****.*** San Francisco, CA
PROFESSIONAL PROFILE
A former bioinformatician with the skills and passion for solving problems using data science. A creative, analytical, and dependable professional looking to leverage my skillset and knowledge to help answer data driven questions and achieve actionable results. I enjoy learning and exploring new technologies.
CORE QUALIFICATIONS
Critical Thinking: Able to critically think at a high level, identify opportunities in large rich data sets, and hypothesize interesting questions.
Modeling: Able to design and implement statistical & predictive models on a wide variety of data types. Experience with data mining, ETL workflows, and feature engineering for modeling. Analytics: Utilize statistical tools to analyze trends and relationships between different pieces of data and draw appropriate conclusions. Develop visualization tools and reports that can help end users access and analyze results. Communications: Able to clearly communicate data insights and tell stories through visualizations. Excel at collaborating across different teams to accomplish goals.
SKILLSET
• Programming: Python, R, Perl, BASH
• Web: HTML5, CSS3, Javascript, JSON
• Frameworks: R Shiny, Django
• Databases: SQL, postgreSQL, MongoDB, SQLite
• Data Science: Pandas, scikit-learn
• Machine Learning: KNN, K-means, Clustering, NLP, Decision Trees, Linear and Logistic Regression, PCA, SVM, Map Reduce, Dimensionality reduction, Naïve Bayes, Text Mining, A/B Testing
• Visualization: Seaborn, ggplot, Matplotlib, Google Charts
• Systems: Linux, Windows, OSX, AWS (EC2 & S3)
PROJECTS
• Predicting genotoxicity risks due to titanium dental implants using support vector machines
• Building an HIV Integrase Inhibitor prediction model utilizing support vector machines in Python
• Building an NBA database and predicting game winning outcomes using machine learning techniques
• Metagenomic data analysis of wastewater using Next- Generation sequencing applications
• Analyzing the effect of drugs on patient’s blood pressures using R
EXPERIENCE
Bioinformatics Engineer
Celgene, San Francisco, CA
June 2014 – June 2016
• Designed and built web applications (Full
Stack) for sequencing analysis
• Designed and implemented databases for
a laboratory informatics management
systems
• Created bioinformatics tools for analyses of
cancer drug candidates
• Support single cell and bulk sequencing
projects utilizing Amazon S3 and EC2
services
Bioinformatics Intern
Genomic Health, Redwood City, CA
June 2013 – Sept. 2013
• Led research for the discovery of breast
cancer biomarkers using copy number
variation analysis techniques
• Developed software pipelines in Galaxy for
detecting CNVs within breast cancer cell
lines
Automation Associate II
XDx,
Brisbane, CA
April 2010 – January 2012
• Provided systems and assay support for
the FDA approved molecular diagnostics
test Allomap
• Analyzed validation data JMP
Systems Engineer
Roche Molecular Diagnostics,
Pleasanton,
CA July 2008 – April 2010
• Programmed and validated molecular
diagnostics assay systems and software for
FDA approval
R&D Engineer
Penumbra Inc.,
San
Leandro,
CA
June 2007– September 2007
• Concept designed and prototyped a new
generation balloon guide Neurocatheter
EDUCATION
B.S. Biological Systems Engineering, 2007
University of California Davis
M.S. Bioinformatics Engineering, 2014
San Jose State University
Data Science, General Assembly, 2014
San Francisco, CA