Scarsdale, NY
adjknc@r.postjobfree.com
https://github.com/sun-gif
SKILLS
●Programming languages: Python*, Pytorch, SQL*
●Data Science tools: pandas, numpy, scipy, matplotlib, scikit-learn
●Machine learning: random forest, logistic regression, k-mean clustering, reinforcement Q-learning, deep learning, feature engineering, principal component analysis
●Statistical method: hypothesis testing
●Applicable software: Sql server*, Spark, cognos
EXPERIENCE
Springboard, 2020 Jan-present
P1) Medical image segmentation for Prostate cancer samples using pytorch
●Developed a pixel-wise mask of each image object to segmentation of PCa structures using U-net Semantic segmentation technique
●Create customized dataset function to load image and mask and mapping and transform them to tensors
● Evaluated the result by Soft Dice Coefficient and NLLLOSS P2) Multi Classification model of Human activity prediction for smartphone data sets
●Developed multi classification model using random forest and LSMT to predict human activity based on smartphone signal using python
●Performance Metrics: Accuracy_score,Confusion Matrix,Roc_auc_score
●Parameters Tuning : Grid search and k-fold cross-validation strategy P3) Train a Smartcab to Drive
●Created Reinforcement Learning to teach an agent learn from its past experience
●Implement and improved a Q-Learning Driving Agent Mediq, Data Scientist, Westerchester, NY 2009 - 2016
●Design and develop machine learning(RF,LG,SVM and k-Mean) solutions to solve diverse business challenges using python
●Gather,Mine and analyze data to perform statistical analysis, identify key factors and build comprehensive visualizations to report findings
●Communicate results of analyses to business partners and executives
●Working on relational databases, including SQL, and large-scale distributed systems EDUCATION
New York University Tandon School of Engineer, Phd in BioChemistry, 2008 Shanghai Jiaotong University, shanghai, China, 2003