Sign in

Data Scientist

Montclair, NJ
February 29, 2020

Contact this candidate


Linhan Yang

217-***-**** Kearny, NJ LinkedIn: in/linhan-yang


● Programming: Python (proficient), R (proficient), SQL (experienced), C++ (experienced), SAS(experienced), Tensorflow, Numpy, Scikit-learn, Pandas, SciPy, Matplotlib, Git, PySpark, Keras, Nature Language Processing (NLP)

● Application: Proficient in MATLAB, PyCharm, Adobe Photoshop, Adobe Lightroom

● Certification: Machine Learning Course -- Coursera WORK EXPERIENCE

Data Scientist Fellow San Francisco, USA

Techlent, Inc. 11/2019-Present

Project: A customized housing/apartment rental recommendation engine specially trained for New York City Area

● Built a python web crawler to scrape posted housing/apartment rental information using BeautifulSoup and Selenium

● Processed houses/apartment description data using NLTK, Gensim, and other customized functions. Evaluated and refined feature data with pairplot and Polynomial features selection

● Built and trained supervised learning regression models to recommend the rental price, using decision-tree models

(Randomforest, XGboost) with Scikit-Learn and deep learning models (CNN, RNN) with Keras and Tensorflow

● Wrapped the model together with the processing and featuring functions as an API using Flask and serve it on Google Cloud to facilitate internal use


Global Population Changes Analysis with Google Earth Urbana, IL, USA Project Lead 09/2018-12/2018

● Created a web crawler for collecting population data, organized and refined captured data with KML file structure

● Visualized the results by creating Google-Earth based files to assist policy maker having an intuitive view about population growth, health, mortality and other population characteristics Crime Rate Analysis and Interaction within Different Crimes Urbana, IL, USA Project Lead 09/2018-12/2018

● Utilized Univariate Analysis and ANOVA tests to get the descriptive information, frequency table and significance level helping to evaluate and clean the datasets

● Interpreted over 90% total crime rate with different crimes in different states using covariance-based Principle Component Analysis (PCA) method to help local government establish a better precautionary system

● Predicted the relationship between the felony and other crimes with Discrimination Analysis to help state government achieve a better crime rate control

Sustainable Energy Development Modeling – Food Waste to Energy Champaign, IL, USA Project Lead 02/2018-08/2018

● Coordinated tasks with other 4 team members and collaborated with the directors in a different department to collect food waste data and map boundaries information

● Selected and refined data from over 30,000 building data utilizing Spark with conditions and reorganized to easy use types

● Achieved to build a decision-tree-based model predicting over $9 M of estimated annual cost savings and provide a theoretical support for the University’s 2020 carbon reduction target EDUCATION

Master of Science in Environmental Engineering Urbana-Champaign, USA University of Illinois at Urbana – Champaign 05/2019 Bachelor of Science in Chemical Engineering and Technology Shanghai, China East China University of Science and Technology 08/2017 Bachelor of Science in Environmental Engineering Lübeck, Germany Lübeck University of Applied Sciences 08/2017

Contact this candidate