Post Job Free

Resume

Sign in

Data Engineer

Location:
Cupertino, CA
Posted:
February 11, 2018

Contact this candidate

Resume:

Joy Cheng (U.S. Citizen)

408-***-**** ac4fq7@r.postjobfree.com http://www.linkedin.com/in/joycheng4bi

Extensive experience solutions-oriented Data Engineer with proven success designing, implementing, and integrating high-performance ETL solutions to BI processes. Serve business functions in different domains including Finance, Digital Marketing/ Advertising. Possess unique combination of technical and analytical skills with demonstrated success and proficiency in program management.

AREAS OF EXPERTISE

Data Engineer

Big Data technologies: Hadoop/Hive/Presto/Vertica

Data Modeling, Data Warehousing, Data Mining in Vertica, Hive and SSIS

Solid understanding of data structure and common algorithms

Proficient in SQL-based languages

Data Visualization – Tableau, Google Data Studio

Microsoft certified MS SQL Server Business Intelligence Developer

Experienced with SAP Business Objects, Adobe Omniture

Programming languages – Python, R, C#, Linux Bash scripting, Batch scripting

Program Manager/ Business Analyst

PMI Program Management

Took ownership of business process while IT designs the system, and drove common themes across processes

Prioritized projects and deliverables to meet deadlines, and executed plan of records

Familiar with Agile Methodologies

Completed internal SPC, 6 Sigma Green Belt training and passed Qualifying Exams

Completed PMP training courses, 102 PDUs

WORK EXPERIENCE

FACEBOOK (via Creospan), Menlo Park, CA 2016 – 2018

Data Engineer, EDW

I was brought to the team as a support Data Engineer to SMB Analytics Operation team. I own the design, development, and maintenance of ongoing metrics, reports, analyses, dashboards to support SMB Business (analysts/data scientists) for driving key business decisions. I implement Dataswarm pipelines to support ETL processes to Vertica and to HDFS. Oversee hundreds of pipelines and quickly pinpoint root cause for failure and implement fix to ensure Business has no downtime. Ensure SLA being met on a consistent basis (both dashboards and pipelines) by optimizing the performance of business-critical queries with low latency. Implement DQ tasks in each pipeline to improve data quality and integrity. Participate in a pipeline framework migration project and bring the result of code standardization and scalability. The migration project transitioned the current data architecture from Vertica to an in-house open-source distributed SQL query engine (Presto) built on top of Hive.

GOOGLE (via Akorbi), Mountain View, CA 2014 - 2016

Integration (Data) Engineer, Google Analytics 360 (2015 - 2016)

Owned Python data integrate loaders with embedded SQL to load client data, and to implement complex and critical engagements to configure Attribution Analytics software. Prep data sets and worked with Data Scientists on data modeling and software algorithm. Client facing role to understand reporting requirements to support ETL processes. Implemented frequent change requests in high-speed environment, and ensured changes were consistent with initial requirements for attribution strategies. Loaded complex customer data into Dremel (BigQuery) with accurate data mapping and successful reporting generation. Wrote MongoDB report definition using YAML. Converted/ Maintained legacy Vertica data warehouse and a legacy UI into BigTables and Google Dashboards.

Business Analyst, Marketing Finance (2014 - 2015)

Designed, implemented, improved and maintained revenue dashboards to track key performance metrics and developed insights for B2B marketing initiatives to support AdWords Marketing team. Oversaw budget requests, minimized spend variance against target, and partnered with accounting on expense recognition. Supported ad hoc requests on revenue analytics and expense management.

Helped a finance analyst to retrieve critical data by providing an ad hoc report when one of the online reports was no longer refreshed. The ownership of the backend query was lost and unable to recover due to previous owners leaving the company. The ad hoc report enabled the analyst to perform critical financial planning for the next year.

Global Marketing Director questioned “coupon abusers” numbers on Revenue Dashboard I owned since it was still showing coupon usage from a channel that had not issued coupons for more than a year. He also urged local marketers to decrease coupon abuser rate. By using other upstream data sources for my investigation, the result corresponded with the dashboard numbers. Further uncovered an issue of coupon issue country was recorded wrong. Director agreed with my finding, and switched the team focus to more pressing business needs.

After only 1-month onboarding with the precedent left company, mentored a Noogler getting on speed. Took ownership on entire Marketing Monetization Reporting pipelines.

HP, Sunnyvale, CA 2007 – 2014

Business Intelligence Engineer (2007 – 2012)

Business facing role in gathering requirements and reporting goals, and to develop metrics dashboard for near real-time reporting and forecasting. Created complex SQL stored procedures and UDF functions for large data set in OLAP/OLTP. Implemented Data Warehouse, which included ETL data between different data sources (Oracle, Excel, SQL Server, log files) using SSIS, SSAS 2008. Analyzed data and presented to Business owners and upper management. Performed Data modeling using Star schema and built OLAP cubes. Implemented solutions in Agile environment.

Built OLAP cubes for business owners to slice and dice reports to fit business needs such as software usage trend base on Region/ Country/ SW titles/ ROI according to marketing questions. Saved company 10-day man hours in developing different reports for each need.

Prepared Ad Hoc reports for account manager to prove to the partner that reports generated at our side were more accurate. Eliminated doubt regarding number discrepancy between HP and partners.

Investigated abnormal redirector traffic data. Discovered that software program appended un-wanted strings to the pre-set target URLs. Identified and engaged related personnel for an emergency fix, and saved company from negative public reviews on the first day of product launch.

Developed online report dashboard using SSRS 2008 to assist business owners making data-driven decisions whether to continue or discontinue products. Enabled organization to better utilize marketing budget.

Evaluated various BI products including SAP Business Objects, and played key role in saving $100k from un-necessary purchase of new license, systems and personnel trainings.

Software Release Program Manager (2012 – 2014)

Executed cross-functional processes to release HP in-house and 3rd-party software onto on-built image. Ensured software quality, schedules, and delivery compliance of software and services. Communicated regular status updates. Maintained bug tracker watch lists. Drove to solve issues proactively with internal/external customers. Scripted installation package for software delivery.

Program manager for over 10 3rd-party deliverables and 2 internal products, including HP Connected Drive.

Coordinated across various engineering organizations (Dev, QA, SW/ HW owners, GBUs, ODMs, vendors) ensuring products met requirements and delivery schedule.

Successfully managed and communicated both domestically and off-shore with multiple time-zones.

Facilitated weekly Core Team meeting, which improved liaison and enhanced working relationships between SW Delivery Team and GBUs.

Influenced project / product planning to meet release schedules and contract compliance.

Identified an issue with vendor SW not shown on image, raised attention of appropriate personnel and developed an emergency resolution that saved HP over $2 million for not breaking the Agreement with Microsoft.

Collaborated and worked closely with product organizations to augment revenues and enhance margins by promoting HP and 3rd party software and services.

DIRECT LOGIC SOLUTIONS, Peoria, IL 2007

Online Marketing firm that helps clients effectively reach targeted customers.

Business Intelligence Developer

Assisted clients (Hasbro, Discover Card, etc.) in developing metrics dashboards with near real-time reporting and forecasting using SSRS. Prepared Ad Hoc reports upon clients’ requests. Implemented stored procedures and UDF for live data reports in SQL Server Reporting Service.

Worked with statistics professor in local college to perform Customer Segmentation Analysis. Pinpointed customer’s demographic data related to products purchased/ showed interest, and forecasted type of toys customer would be likely to purchase in the future. Played key role in assisting marketing team to tailor internet marketing strategy for Hasbro.

INNOLUX DISPLAY CORP. (Foxconn) Chu-Nan, Taiwan 2005 - 2006

Leading LCD manufacture in Taiwan.

Business Intelligence Developer for Supply Chain Planning

Developed Analytics systems for keeping track of production material integrity using SQL Server, Oracle and ASP.NET. Created various management tools written in C#.

Created web-based ERS-MES monitoring system. Provided complete view of real time inventory in LCD fab, warehouse, and WIP. Showed trend charts on ERP-MES start and ending differences in selected period. Improved start and ending material difference from > 50% to < .5%

Created auto-mail system to send out daily abnormalities to different user groups based on root causes. Improved MES-ERP STD close to 5X

Created Web Work Order Monitoring System to monitor consistency of WIP data between MES and ERP by analyzing work order data. Quantity difference of WIP between MES and ERP dropped from 6% to 0.3% for each plant

NATIONAL INSTITUTE OF HEALTH, Bethesda, MD 2003 - 2005

Associate Investigator/ MATLAB Developer

Assisted in generating objectives for embryo development research protocols. Designed relational database using MS Access for managing study subjects’ information and research results. Analyzed data using MS Excel and SPSS.

Utilized background in bioengineering and computer science to develop MATLAB app. associated with other stand-alone SW for calculating/analyzing fetal respiratory Doppler’s images. This MATLAB program played a key role in determining genetic disease before birth using correlation data of historical and collected data. Co-authored the research result in an article published to American Journal of Obstetrics and Gynecology (2003)

EDUCATION

M.S. in Computer Science, University of Illinois at Springfield

B.S. in Bioengineering, University of Illinois at Urbana-Champaign



Contact this candidate