Po-Yu Yang
Phone 201-***-****
E-mail ***********@*****.***
GitHub github.com/dar1enyang
LinkedIn linkedin.com/in/poyuyang15
To obtain a full-time software developer position
****-** -
Stevens Institute of Technology, Hoboken, NJ
• Master of Science in Computer Science
2010-09 -
2014-06
National Chiao Tung University, Hsinchu, Taiwan
• Bachelor of Engineering in Civil Engineering
2016-09 -
2017-06
Software Developer
HBH Realty Group, Taipei, Taiwan
• Maintained backend and frontend of HBH websites and extended existed attendance features
• Designed and developed real estate listings processing applications & accounting systems using Java
• Analyzed associated sales data for delivery to clients and wrote SQL queries with MySQL database Real Estate Listing Scraper
Designed a web scraper for realtors with organized property listing data which can be analyzed to determine sales and prospective buyers using Java and Jsoup package
•
Stored the housing data with MongoDB database deployed on AWS and made it more accessible to the analytics team using MongoDB Compass
•
• Automated the application to scrape new listings based on desired update frequency and location code Currency Reminder
Created a web-based application for currency exchange rate tracking including scraping latest rate and personalized rate alerts to monitor currency pairs features
•
Integrated with APScheduler API to update the latest exchange rate and mailgun API to send users notification emails with transactional rate alerts
•
Implemented backend with Python & Flask, database with MongoDB, and deployed the application with Heroku and mLab
•
Data modeling with PostgreSQL & Apache Cassandra
Designed a solution to make data more accessible by building an ETL pipeline that transfers data from two directories into the database
•
Created a SQL database with tables designed to optimize queries on song play analysis and migrated to NoSQL database when requirement changed to support massive writes.
•
• Improved NoSQL tables partition read performance by modeling the data to fit target queries Cloud-based data storage repositories - Data Warehouse & Data Lake
• Designed a cloud-based data warehouse to support data analysis, later transfer to a data lake Built an ETL pipeline that extracts data from S3 buckets, stages them in Redshift, and transforms data into a set of dimensional tables and fact table
•
• Optimized Redshift query performance up to 70% with table design applying different distribution and sorting style
• Built an ETL pipeline for the data lake with data processed into analytics tables using Spark and deployed on AWS Programming Languages: Java, Python, JavaScript
Full Stack: Flask, Morphia, Spring, Hibernate, Maven, AngularJS, HTML, CSS Databases: PostgreSQL, MySQL, MongoDB, Apache Cassandra, AWS Redshift, AWS S3 Data Collection/Storage: Data Modeling, ETL pipeline, Data Warehouse, Data Lake(PySpark) OBJECTIVE
EDUCATION
EXPERIENCE
SIDE PROJECTS
SKILLS