Post Job Free

Resume

Sign in

Enterprise Data Big

Location:
San Francisco, CA
Posted:
April 23, 2024

Contact this candidate

Resume:

Enterprise Data/Solution Architect: Technical Leadership/Architect

Introduction

I am a typical data expert and technical lead for Enterprise Data Strategy, Data Solution, Enterprise Data Integration (EDI), Enterprise Data Warehouse (EDW), Business Intelligence (BI), Metadata, Data Quality and MDM (Master Data Management), Data Mining/Analytics/Data Science, with expertise from theory to practice, from database to applications, from OLTP to OLAP, from ETL to Reporting, from Analytics, Machine Learning, to Data Mining, from MPP System to Big Data Eco-system (Hadoop, Yarn, MapReduce, Kafka, Spark, etc), from architect, DBA to Database development and performance tuning, from development support to production support.

I am very experienced in large-scale projects in all phases of the development lifecycle including needs assessment, requirements gathering and definition, enterprise business architecture principles and industry standard architecture frameworks, architecture/design and development, etc. I have worked on projects in Enterprise Data Strategy (including VLDB and Big Data) and Data Governance, EDI/EDW and Data Marts architecture and development, MDM and Metadata Repository, ETL/BI tools selection, DW/ETL Best Practices, BI/enterprise reporting and analytical applications, etc.

Leadership in all the following technical area

Strategies, roadmaps, solution architecture, information system architecture principles and frameworks, information systems security, data/system modeling and development of Data/Database system, MDM, EDW, Data Marts and Enterprise Metadata, Cloud/AWS EC2 AND EMR, Data Lake/lakehouse, Big Data Eco-system, especially their overall system quality, security, performance and optimization.

RDBMS overall expertise. I worked on almost all kind of RDBMS like Oracle, Teradata, RedShift, Snowflake, Greenplum, SAP Hana, Netezza, DB2/UDB, MS SQL Server and Sybase, etc.

NoSQL and Big Data Ecosystem Infrastructure. Hadoop Cluster, MapReduce and Spark, Kafka and Spark Streaming, Hive, Cassandra and HBase.

ETL systems. I worked on most of the main ETL tools like Informatica, Pentaho, DataStage, Talend, Ab Initio and SSIS/DTS with expertise on everything in Informatica 5 to 10 including BDE/BDM (Big Data Edition/Big Data Management) and cloud version: Designer/Developer, tasks/workflows, repository, server administration and Metadata Manager.

Business Intelligence, especially on Cognos, Tableau, and Business Objects.

Data profiling, data quality, data standardization, data security, data stewardship and governance (including Big Data with Sentry, Ranger, Cloudera Navigator, IBM Guardium, DgScure, Atlas, Knox, etc).

Problem solving and troubleshooting with very strong analytic and logic ability.

Deliver projects with high quality on time and within the budgeted resources.

Skills summary

Information/Data system solution architecture design and management, best practice on architecture principles and frameworks.

Technical leadership on Lakehouse with Databricks and Snowflake including migration strategies, system architecture, data ingestion/processing pipeline design and development, data modeling, database (Snowflake and Redshift) design and DBA.

Solid experience on AWS (S3, Redshift, Spectrum, Athena, Glue, Lambda, etc.), Hadoop/Big Data, Kafka, Spark and NoSQL databases like HBase, Hive and Cassandra, etc.

Efficient with all kinds of Database tools (ERwin, Embarcadero ER/Studio and DBArtisan, Oracle Enterprise Manager, Oracle Management Package, SQL Navigator and Toad, DB2/UDB Control Center/Command Center, MS SQL Server Enterprise Manager and Sybase PowerDesigner.

Excellent ETL architecture, design and development with several tools like Informatica PowerCenter (version 5 to 9.6.x), Pentaho/Kettle, BODI and Talend, etc. Informatica PowerExchange, Repository Manager, Metadata Manager and Repository Server administration.

Data sources include, but not limited to mainframe VSAM files, flat files, XML files and data stored in all kinds of databases like Oracle, DB2/UDB, Sybase and MS SQL Server.

Data Warehouse Multi-Dimensional/Star Schema design and performance special skills.

Data Warehouse Metadata and Enterprise Metadata Repository design and administration experience.

MDM leadership on Modeling, data quality, data standardization, data security, stewardship and governance.

Training

Cognos suite, Teradata suite, Purisma/SAP PDH MDM.

Business training on Banking, Credit/Risk Management and Basel I and II.

Project management, Agile/SCRUM, Six-sigma, Executive Communication and Professional Service Development And leadership.

Recent Major Work experience

11/2018—present, Sr. Data architect, United Healthcare

09/2010—11/2018, Consultant/Data/Big Data Architect/Information System Solution Architect

Clients: Facebook, Xcel Energy, GE, BlackRock (the world’s top asset management financial investment company), Gap Direct (Retail), eBay, SignalDemand (SaaS services for Foods (Wholesale and Retail) customers like Cargill, Farmland Foods, Hormel, National Frozen Foods, and Ventura Foods, etc), Salesforce.com, Fidelity Investment (BI Cloud/SaaS with Birst, Inc), etc.

The major work in these years includes, but not limited to:

Roadmaps, POCs, enterprise business and industry standard architecture frameworks and best practices, Data Modeling and Solutions of Data Security, Data Governance, Enterprise Database, Data Warehouse, Data Mart, and ODS (Operational Data Store), Data Lake, Cloud/AWS/EC2 and EMR Systems.

Data Lake-house architect and data processing with S3, Redshift, Spectrum, Athena, Glue, Lambda, Snowflake, PySpark/Databricks, etc.

Big Data Processing and Analytics with Python, Hadoop, MapReduce, Kafka, Spark (Python and Scala), and NoSQLs like Cassandra, HBase, Hive, etc.

Teradata, Oracle and Exadata, MySQL, RedShift, Snowflake, Greenplum/PostgreSQL, SAP Hana and MS SQL Server, etc, databases design and development (SQLs, Stored Procedures/Packages, database utilities, etc), logical and physical modeling, performance tuning and production support.

Architect, ETL, Data Integration of all kind of data sources of in-house Customer, Account, Product/Service applications and commercial applications like OBIEE/Siebel Analytics/DAC, Oracle E-Business Suite/Applications (Financials (AP, AR, GL), Order Management (PO), HRMS/HCM) etc.

Design and development of the ETL (Informatica PowerCenter, BDE/BDM and Cloud, DataStage, Talend, Pentaho Kettle/PDI and BODI/SAP BusinessObjects Data Service) processes with type 2, type 3 slow changing dimensions.

Establish ETL Best Practices on all kinds of RDBMS and other data sources.

Performance tuning of the ETL and BI used SQL statements including Analytic functions.

Earlier Work experience

Dun & Bradstreet. San Francisco, CA (2/2009—09/2010),

Principal consultant/Solution Architect (Director Level)

Clients:

Aon, DHL, Verizon, Cisco, etc.

I have been intensively trained on MDM technologies and solutions that include, but not limited to, Data Governance, MDM Architecture, Modeling, Advanced Match and Merge as well as Professional Service Development and Business and Corporate Leadership.

The main work and responsibilities are:

Lead the whole lifecycle of the Data Solution, Data Governance, Data Security, Master Data Management, Metadata Management, Enterprise Data Integration and BI implementation.

Provide MDM, Data Integration and BI Roadmap, Maturity Analysis/Assessment and solution strategies.

Lead Information/Data Solution Architecting, Designing and Data Modeling of all the related systems.

Provide expertise on database, ETL and BI best practice.

The technologies included, but not limited to Oracle 10g-11g, Teradata, Data Warehouse, NoSql, OBIEE, ETL (Pentaho/Kettle, Informatica and BODI/SAP BusinessObjects Data Service,) SAP Information Steward, Metadata Management, Data Service and Data Abstraction for SOA and MDM/CDI, etc.

COMSYS (11/2002—12/2005), and ABS, Inc. (01/2006—02/2009),

Sr. Manager, Enterprise Data Architecture/Data Warehouse/BI/MDM.

The work in these projects includes, but not limited to:

Database Systems Architecture, Data Integration and Data Modeling of Enterprise Data, (Active) Data Warehouse, and Enterprise Metadata Management. Data modeling of the dimensional data warehouse, data mart, and ODS (Operational Data Store.)

Establish Data Strategy, Policies, Stewardship, and Governance.

MDM/CDI and ETL Best Practice.

Design, Deployment, implementation of MDM Data Hub.

Architecture, ETL/Data Integration of all kind of data sources of in-house Customer, Account, Product/Service applications and commercial applications like SAP, OBIEE/Siebel Analytics/DAC, PeopleSoft, Oracle E-Business Suite/Applications (Financials (AP, AR, GL), HRMS/HCM, Order Management (PO), Procurement,) etc.

Data source analyses and profiling that include data from flat file, XML source, Oracle, DB2/UDB, Sybase, MS SQL, Mainframe/VSAM and Teradata and applications include PeopleSoft Finance/Oracle ERP/Applications, Ventive CRM, SAP CRM, SAP Financial and SAP BW, etc, and self-developed applications.

Migration of Oracle, Informatica and Cognos, OBIEE/Siebel Analytics and in-house applications to Teradata.

Standardize ETL design, development, QA, implementation and production support.

Performance tuning of the ETL, OBIEE/Siebel Analytics and BI (Cognos, Hyperion/Brio and Business Objects) used SQL statements including Analytic functions.

Oracle, Teradata, Greenplum, UDB/DB2, MS SQL Server, HBase, etc, databases design and development, Logical and Physical design and performance tuning.

Provide guidelines on the architecture of the BI tools and how to leverage the BI functions in the overall enterprise data warehouse systems.

Unix/Korn Shell scripting for database and Informatica command calls.

Production support, troubleshooting and problem solving for the above listed systems, etc.

Clients and Technologies

Clients:

eBay Inc. They have the largest Teradata Data Warehouse System in the world (2.6 PBs)

Merrill Lynch, MetLife, Nomura Securities, DirecTV, IBM, TransUnion, and Orbitz.

Technologies:

Erwin Data Modeler and Model Manager/ModelMart, ER\Studio, PowerDesigner, and Rochade/Adaptive/Informatica MM (Metadata tools).

All kinds of database (Oracle, Teradata, Greenplum, MS SQL, etc,) utilities and tools.

Hadoop, HBase, and Hive.

Rational Data Architect/Rational Rose/UML, etc.

All kinds of ETL tools (Informatica, Ab Initio, DataStage, Talend, and SSIS, etc.)

All kinds of BI and OLAP tools (Business Objects, Cognos, MicroStrategy, Hyperion/Brio and SSAS, etc.)

IBM Information Server (including WebSphere DataStage, WebSphere Information Analyzer, etc.)

Siperian and Purisma D&B MDM, etc, and FirstLogic for data cleansing.

SuperGlue/Metadata Manager and Rochade, etc.

The Carpe Diem Group (03/01—10/02)

Consultant/ Data Warehouse specialist

Client: ECI Telecom/ECTel

Lead database and data warehouse architect, DBA and developer lifecycle.

Redesign, remodel and development of the pre-built data warehouse/BI products.

Multi-dimensional data warehouse architecture design with conformed dimensions, facts, data marts, etc (Kimball Methodology).

ETL design and development for the above products for the clients.

Architect, modeler and database developer of the Next Generation QOS on Voice over IP system.

CyberCash (VeriSign, Paypal) (03/00—03/01)

DBA, Data Warehouse Architect and ETL Developer.

Technical Lead of data warehouse projects for CRM and Corporate Financial Reports that include Accounting/GL/AP/AR and P&L.

Meanwhile, I was also a conjunct professor of the Masters program in the Department of the Computer Science, University of Northern Virginia. Courses taught are Database theory, Oracle 8i DBA Certificate, etc.

Data modeling (both relational/OLTP and OLAP/dimensional including dimensions, facts and data marts) and Data Architecture with ERwin, Designer/2000 and PowerDesigner (Kimball Methodology).

Korn Shell Scripts, PL/SQL, SQL*Plus and Oracle Utilities.

Database logical and physical design and performance tuning (Oracle and MS Sql).

DBA on Oracle7.3 to 8.1.6 and SQL SERVER 6.5 to 2000.

MS OLAP/Analysis Services/MDX dimensions and cubes design and development.

Database installation and configuration, problem solving and troubleshooting, database upgrade and migration.

ORCC (11/98—03/00)

Sr. Software Engineer and DBA.

The company provides online-banking and financial services for hundreds banks of all sizes.

SynQuest (12/97—11/98)

Senior Software Engineer/Product lead. (The company is a public ERP/SCM and Data Analysis/Data Mining Software vendor.)

Highest Education

University of Florida, MS. 1998

Graduate study in Mathematics (Ph.D. on Numerical Optimization), ECE and CIS (MS), University of Florida. Main classes include, but not limited to, Computer and Systems Engineering (Distributed/Large Scale Database, Digital Image Compression and Processing, Wavelets and Neural Networks, etc), Telecom (Networking, Digital/Data Communications and Digital Signal Processing, etc.)



Contact this candidate