Jason Zhang Senior Data Engineer
Foster City, CA 94404. 650-***-**** ac02l0@r.postjobfree.com
Summary
Jason as a data engineer is available to work immediately in San Francisco bay area with L2 visa EAD.
-10 years of experience in data modeling, warehouse, pipeline, visualization development.
-3 years of data scientist work experience in implementing statistics, data mining, machine learning solutions to E-Commerce and Manufacturing business issues.
Technical Skills
Programming: HiveQL, SQL, R, Python, Shell, Java, JavaScript, C#, C++
BI/Big Data: Hive, Sqoop, Oozzie, Hadoop, Spark, Impara, HBase, Kibana, Elasticsearch, Splunk
Tableau, QlikView, SAP BW/BOE/CX/CR/WebI/Voyager/DQ, Informatica Power Center
Professional Experience
Jun 2016 – Mar 2017 • Ffan E-Commerce • Principle Data Engineer, Data Scientist
Responsible to analyze billions of rows of data and drive actionable insights over 6000 plazas, 100,000+ sellers and 180 million customers, to help Ffan to measure APP performance, monitor the trend:
- present key findings derived from data to executives bi-weekly.
- designed dimensional data model, developed 30+ Tableau dashboards, 10+ Kibana analytics.
- developed Sqoop jobs to transfer data between MySQL and Hive.
- created Oozie workflows to automate Sqoop jobs.
- Imported data into R/Python, created training and test set.
- created customer segmentation, fraud monitoring using Decision Trees for machine learning.
Environments: Tableau, Hive, Sqoop, Oozzie, Hadoop, R studio, Python/Spider, Kibana, Elasticsearch
Award: Outstanding Employee, 2016
Apr 2014 – Jun 2016 • COMAC • Senior Data Engineer, Data Scientist
Responsible to develop data analytics to get insight of airplane suppliers, millions of parts, stock, manufacturing, quality, machines, finance and HR:
- built charts in Unity3D by D3.js, accessing HIVE server using thrift in Java.
- implement dimensional data model in Oracle data warehouse, developed 50+ dashboard, and WebI report.
- writing scripts for Informatica ETL jobs to integrate business data to data warehouse.
- created supplier segmentation using K-Mean clustering machine learning.
- developed Sqoop import/export jobs to build pipeline between data warehouse and HIVE.
Environments: Hive, Sqoop, Oozzie, Hadoop, R studio, Python/Spider, SAP Business Objects, Oracle
Award: Excellent Project Manager, 2015 / Excellent QC team, 2015 / Outstanding Employee, 2014
Apr 2012 – Feb 2014 • Van Hessen BV • Data Engineer
Responsible to develop statistical reports for finance, production-planning, sales, purchase team, providing actionable insight of ERP data, using QlikView on top of Oracle.
Aug 2010 – Apr 2012 • Hewlett-Packard Enterprise Service • BI Engineer
Responsible to develop BI solutions using WebI, Crystal Reports, develop pipeline using Connect-IT to integrate IT service and device data, helping IT team for better service and infrastructure management.
Jul 2006 – Aug 2010 • SAP Business Objects • BI Product Escalation Engineer
Responsible for products issues management, advanced trouble shooting of different BI products including SAP BW, BOE, Crystal Report, Web Intelligence, DeskI, Xcelsius, Voyager.
Publication:
Oct.2004, book of “Beginning Game Programmers in ASP”, Tsinghua University Press
Education
Sep.2003 – Jul.2006 (3 years), Chinese Academy of Science, Master in Computer Application
Sep 1999 – Jul 2003 (4 years), National University of Defense Technology, Bachelor in Computer Science