Sign in

Python, SQL, AWS, Machine Learning, NLP

Hoboken, NJ
February 28, 2020

Contact this candidate


Yu-Lin Shen

Jersey City, NJ +1-929-***-****


New York University, New York, NY (GPA: 3.22 / 4) Expected 05/2021 M.S. Computer Engineering with Merit Scholarship: $8000 per year

Relevant Courses: Data Structure & Algorithm, Database System (Oracle), Computer System Architecture (C++) National Taiwan University, Taipei, Taiwan 07/2017 B.S. Mathematics (GPA: 3.16 / 4.3)

Leaderships & Activities: President of the Baking Club and Human Resource Officer in the Art Festival Professional Experience

Data Engineer LnData Technology Co., Ltd, Taipei, Taiwan 09/2017 - 06/2019 Customer Value Platform - L'Oréal Full Stack Developer (Python, MySQL, MongoDB, AWS) Description: A SaaS, integrating user data on social media with L'Oréal CRM via mission assignment with rewards

Designed a serverless system structure under AWS components (EC2, lambda, API Gateway, SQS, CloudWatch) with the joint tables on MongoDB and MySQL

Established APIs via Tornado and auto web parsers under CentOS with Git version control with SourceTree LnSocial (Python, Git, Docker)

Description: A SaaS for social listening

Built scalable batch web crawlers (requests, scrapy, selenium) on Docker Container processed version control on Git with back-up indexing (Elasticsearch) for data consistent

Complemented the existed service with new function as clients-emailing, events-alerting, auto-reporting

Implemented self-designed algorithm for parallelizing crawler mission distribution

Developed management mechanism for business team (non-coders) to real-time audit over ten of thousands’ social media fanpages through concatenating Excel, MySQL, and instant crawler Customer Interest Tagging Model Model Developer (Python, Keras, TensorFlow, NLP) Description: integrated online text data and web cookies behavior to tag interest on end-user

Achieved 91% accuracy on categorizing 20 industrial interests via K-means clustering, Random Forest in scikit- learn and LSTM in Keras and TensorFlow

Defined the weights of updating Mandarin nouns by using TF-IDF, Bag-of-Words methodologies and decomposed Mandarin common statement from 20+ industries via N-gram for vocabulary dictionaries

Processed gigabyte data sets of user cookies, IPs, and mobile device IDs for winnowing out invalid traffic from anomalous sources and frequency

Supported prototype initiation and successfully gained 20000+ USD government funding to win various clients, including Dyson, and L'Oréal

Opinion Leader Related Network-Based Trading System Method & Storage Medium Patent Builder Description: A system to match social media influencers to different industries

Defined a set of main indicators as the core model with three layers to evaluate users’ performance (KOL) on social media platforms with statistical and mathematical approaches Project Researcher Prof. Kung’s Lab., National Taiwan University 08/2018 - 11/2018 Topic: Harbor pilots’ working time prediction in the Port of Kaohsiung (the largest harbor in Taiwan)

Optimized the original model in 30% lower MAE of working time prediction through scikit-learn and Keras Intern LnData Technology Co., Ltd, Taipei, Taiwan 03/2017 - 06/2017

Organized individual online valid cookies, and offline demographic data for brands’ CRM panels

Evaluated the trending brand events on social media and websites in the annual report for Tiger Beer and Heineken

Selected Projects

User Login Credential and Authorization (Python, Flask-RESTful, SQLAlchemy, Postman, Heroku)

Practiced the Flask-RESTful on user login procedure with JWT token, manipulated the SQLAlchemy as the database connection and deployed the Flask API on Heroku Skills

Programming Languages: Python, SQL, C++, R

Tools: AWS, MySQL, MongoDB, Elasticsearch, Git, Docker, Linux, Microsoft Advance Excel, SAS EG Interests: Technologies, Modern Art, skateboarding, Traveling, Baking

Contact this candidate