Post Job Free
Sign in

Data Warehouse

Location:
Evanston, IL, 60203
Posted:
March 09, 2010

Contact this candidate

Resume:

Ge Peng

**** ***** *****, ********, ** *****

*************@*****.***; 847-***-****; 847-***-****

Data Warehouse and ETL expert with more 10 years of experience in Information Technology, focusing on full

lifecycle of Data Warehouse and ETL architect, development and operation in multiple environments include

Oracle, Sybase, SQL Server, Informix; specializing on using Perl, Shell Script, PL/SQL, and ETL tools outstanding

on reverse-engineering of undocumented Data Warehouse processes based on excellent profile, analysis skills,

in-depth knowledge, and solid working experience on programming and data architecture, demonstrated

proficiency in using various tools to optimize and automate Data Warehouse operation. Good experience in

interactive with business users and data vendors.

Technical Expertise

Database: Oracle 9i/10g, Sybase, SQL Server, MS Analysis Service, Informix, PostgreSQL, MySQL;

ETL Tool: DataStage, DTS, Informatic;

Language: Perl, Shell Script, SQL, PL/SQL, T-SQL, XML, C/C++, Java, Visual Basic, ESQL/C, MDX, HTML/

DHTM, AWK, ASP, JavaScript, JSP, VB Scrip;

Software: Toad, SQL Navigator, SQL*Plus, SQL Advantage, dbish, dbaccess, Sybase PowerDesigner,

Erwin, Visual Studio, MS Visio, MS Enterprises Manager, ClearCase, ClearQuest, MKS, TWS,

AppWorx, cron, Maestr, Documentum, vi, gcc, g++, aCC, Make, CVS, dbx, dbg, HP-DDE, prof,

gprof, Syncsort;

CPAN: http://search.cpan.org/~tigerperl/Data-Hierarchy-Traverser-0.01/lib/Data/Hierarchy/Traverser.pm

Professional Experience

Classified Ventures. Chicago, IL

Sr. Data Warehouse Developer 2006 - 2009

Worked in DW/BI team of corporate services department of the online advertising joint-venture, provided data

integration services for the corporate and its three verticals, Cars.com, Apartments.com and HomeFinder.com.

o Designed and developed ETL processes with PL/SQL, DataStage, Perl, and Unix script for integrating

data from various data sources including web activity log files from three verticals, plain text and XML

data files from third-party data vendors. Cooperated with data venders on XML schema design;

o Reverse engineered existing ETL processes by redesigning PL/SQL packages, Perl scripts, and Unix

Shell scripts, reduced process time (most of the process time shortened from hours to minutes), improved

processes maintainability, eliminated data errors, provided clear documentations, enhanced data

consistency;

o Migrated hand-coded ETL processes (implemented in PL/SQL, Perl, Shell, and Java) to DataStage;

Provided tools for automating data transfer and ETL operating and Data Quantity monitoring;

o

Proposed and assisted with establishing ETL design and test documentation standard.

o

Independent Consultant, Chicago, IL

Consultant 2004 – 2006

• Data Warehouse/Perl Consultant at Citadel Investment Group, LLC. An international financial institution.

Worked in Global Equity team to build ETL applications for supporting trading by using Perl, Shell script,

o

and T-SQL to extract, clean, conform and load data from internal data sources and external data venders

to Sybase IQ;

Worked a sole full-functioned role on development a decision support system for trading stock of an

o

internet company (collected requirement from investors, design Data Mart architect in start-schema on

Sybase IQ, profiled data sources with Web spider and scrubber techniques, developed ETL process with

Perl, XML and T-SQL, provide OLAP cube as analysis tool with MS Excel).

• Data Process/Perl Consultant at Walgreens, Co, the nation's largest drugstore chain.

Built Perl applications to collect and integrate data from OLTP database and Web logs, transferred data to

o

internal users, external service providers and end users with FTP, Email and API;

Collaborated with web application development team; analyzed existing build/patch/deploy approach;

o

design new deployment process and tools with shell scripts and CVS;

1

Ge Peng

3317 Manor Court, Evanston, IL 60203

*************@*****.***; 847-***-****; 847-***-****

Worked with DBA to tune SQL statements with performance issues; built small tools in Perl and Shell

o

script for monitoring operating systems, web servers, database servers and network traffic; built web

spider and scraper for automate web page testing;

• Worked with business users, web application development team, and enterprises data warehouse team

requirement collecting, source data analyzing and proposed data warehouse integration solutions for e-

commerce data process.

• ETL Consultant at Allstate, Co. an insurance company.

Worked on two data Warehouse projects, designed and developed UNIX Shell/Perl scripts and templates

o

to generate dynamic ETL processes (Ab Initio and SQL/PL, SQL scripts) for extracting data from DB2 on

Mainframe, cleaning and conforming, and loading to Oracle on SunOS;

Designed and built ETL log data process tools with Perl, Shell script and Oracle table for analyzing ETL

o

performance and data quality;

Created tool with Perl to compare Oracle database schemas among development, QA, and production

o

environments, generate XML file to report the difference and generate DDL scripts for DBA to synchronize

the schemas.

• Data Warehouse Consultant at DeepData, Co. an directory data services at Chino Hills, CA (telecommute).

Re-designed and built data warehouse on PostgreSQL with star schema;

o

Created Perl and Bash scripts of fine-grained control parallel processes for huge volume of data;

o

Analyzed raw data (yellow page) for finding data cleansing, correcting, and mapping approach with Perl

o

and Excel.

Information Resources, Inc. Chicago, IL

Software Engineer 2000 – 2004

Worked in Information Technology team of Market Research department of the enterprise market information

solutions and services provider.

o Worked with Microsoft consultants and DBA’s on multiple OLAP applications and associated ETL

processes design; automated ETL processes with Win2k/win2k3 Advance Server and SQL Sever DTS

package; improved performance by redesigning aggregation methods and applying parallel, distributed

computing techniques. One of the application build for 7-Eleven won First Annual ‘Reinventing CPG

Summit’ Awards in 2004;

o Designed and implemented ETL metadata and aggregation metadata repositories for multiple

ROLAP/MOLAP applications on Informix, Oracle, Oracle Express, SQL Server, and MS Analysis Services;

o Analyzed IRI's Data Warehouse process systems, profiled information on operating systems, relational

database systems and reporting systems, re-designed and implemented data colleting and processing tool

with Perl, realized the business requirement on generating client billing report by shortening the process

time from 18 days to about 10 minutes;

o Analyzed IRI’s special database, InfoView, re-designed server side process with new C++ classes for

extracting data, and built new client side interface with ActiveX objects to generate Excel Reports;

o Modified Informix server C code (I-spy) with C++ for SQL statement rewriting to optimize query

performance on Informix virtual tables; applied rule-based technique to improve application maintainability.

Education

North Dakota State University, Fargo, ND 2004

Master of Science in Computer Science

Sichuan University, Chengdu Sichuan, China 1988

Bachelor of Science in Physics

2



Contact this candidate