Jacky Xue
315-***-**** ac2f8t@r.postjobfree.com San Mateo, CA https://www.linkedin.com/in/jacky-x-49188113a/
SUMMARY
Excellent data analysis ability equipped with strong programming skills and advanced statistical knowledge. Skilled in data mining/exploring and resolving issues with quantitative/modeling methods.Passionate about explaining technical concepts to non-tech audience.
SKILLS
●Programming: Python, R, SAS, Matlab.
●Database: MySQL, NoSQL, Teradata, MongoDB
●Data visualization and Front end: Spotfire, SAS Visual Analytics, Tableau, JavaScript, CSS, Html.
●Statistics: Regressions, Classification, Hypothesis (A/B) testing, ANOVA, Machine Learning, Optimization, Survival Analysis, Monte Carlo Simulation, Experimental Design
EXPERIENCE
Data Scientist Fellow Springboard 06/2017-Current
●Kaggle project quora question pairs. Ranked in top 8% on Kaggle leaderboard and won the bronze medal.
Extracted 60+ NLP features such as tf-idf, euclidean distance, cosine similarity, word2vec embedded features.
Used multiple machine learning algorithms (xgboost/random forest/logistic regression/neural network deep learning etc.) and ensample skills(stacking/bagging) to build models and make predictions.
Senior Statistical Programmer Gilead Science 01/2013-Current
●Building tools to aid in business decisions - eg. Designed end-to-end data visualization tool which was then used as the primary EDA tool by cross-functional teams in Biometrics department of 500+ users.
●Formulating clinical questions into analytic projects and performing statistical models to solve the questions . Excellent data sense in EDA(exploratory data analysis), creating compelling visualizations, and drawing conclusions from different statistical modeling techniques.
●Leading the team to set the data infrastructure, develop sample codes and macros, build database to support the visualization tool.
Statistical Consultant PharmaNet/i3 06/2011-01/2013
●Built and maintained a biometrics database to support the company wise analysis work.
●Used different statistical packages to perform analysis on clinical data for drug safety and efficacy analysis and submit the results to FDA.
SAS Programmer Amylin Pharmaceutical 06/2009-06/2011
●Wrote code to perform statistical analysis and tests on clinical trial data using SAS or R.
EDUCATION
Syracuse University, Master in Economics (Tutored on Econometrics) Syracuse, NY 2006-2008
SWUFE, Bachelor in International Economics and Trade Chengdu, China 2002-2006