Post Job Free

Resume

Sign in

Data Scientist

Location:
Lincolnshire, IL
Posted:
October 09, 2020

Contact this candidate

Resume:

XIN GU

North Chicago Area, IL 617-***-**** adgtpd@r.postjobfree.com

www.linkedin.com/in/xingudatascience/ github.com/xgu1-ds?tab=repositories US Citizen EDUCATION & CERTIFICATION

INDIANA UNIVERSITY, Bloomington, IN

Master of Science (MS) in Data Science Anticipated Dec. 2020 Coursework: Introduction to Statistics, Applied Machine Learning, Applied Data Science, Big Data Management, Applied Statistics, Engineering Cloud Computing, Introduction to NLP for Data Science, Machine Learning with Spark, Basics of Scala, Deep Learning, Data Science in Practice

EAST CHINA UNIVERSITY OF SCIENCE AND TECHNOLOGY, Shanghai, China / THE MAX PLANCK INSTITUTE FOR COAL RESEARCH, Mülheim an der Ruhr, Germany

Doctor of Philosophy (PhD) in Chemistry Engineering 2011 EAST CHINA UNIVERSITY OF SCIENCE AND TECHNOLOGY, Shanghai, China Bachelor of Science (BS) in Applied Chemistry 2006 SAS GLOBAL CERTIFICATION PROGRAM

SAS Certified Professional: Advanced Programming Using SAS 9.4 2020 SKILLS

Machine learning and data mining: classification, regression, clustering, anomaly detection, neural networks Statistics: statistical, exploration data analysis, statistical inferences, hypothesis test, nonparametric statistics, regression models Programming and data analytics tools: Python, R, SAS, SQL Database, R Shiny, Tableau Python Libraries: Pandas, Numpy, Scikit Learn, NLP, Scipy, Gensim, Keras, Pytorch, Matplotlib, Plotly Big Data and cloud computing: AWS

EXPERIENCE

Research Assistant, School of Chemistry and Molecular Engineering, ECUST, Shanghai, China Sep 2008 – June 2009

• Oversaw maintenance, operation and data interpretation of experimental instruments.

• Taught small groups of undergraduates about instrument operation.

• Mentored two junior PhD. students to help them exploring research area, setting goals and identify resources. PROJECTS

• EYEX (Contributor, Fall 2020)

o Create a deep learning algorithm for the rapid detection and diagnosis of COVID-19 through automated X-ray analysis. o Skills and technologies: Computer Vision, Transform Learning, Data Augmentation

• Cloudmesh Volume Management (Contributor, Spring 2020) o A simple abstraction layer to manage Cloud Volumes for AWS, Azure, Google, Openstack, Oracle and Multipass. o Skills and techniques: Python, Command Line Interface, Cloud Computing

• COVID-19 Open Research Dataset Challenge (CORD-19) (Contributor, Spring 2020) o Executed text mining and topic modeling for literature about coronaviruses as part of Kaggle Competition. o Skills and techniques: ETL, NLTK, Spacy, LDA, GloVe

• Home Credit Default Risk (HCDR) (Contributor, Fall 2019) o Predicted whether a client will repay a loan based on home credit default risk (HCDR) as part of the Kaggle Competition. o Skills and techniques: EDA, Metrics, Baseline, Feature Engineering, Feature Selection, Hyperparameter Tuning, Machine Learning, Logistic Regression Models, PCA

• Capital Bikeshare Data Management and Analysis (Contributor, Fall 2019) o Implemented big data life cycle pipeline from data injection, storage, and analysis. o Skills and techniques: ETL, MangoDB, Hadoop, Map-reduce, EDA, Feature Engineering, Random Forest.



Contact this candidate