Post Job Free
Sign in

Data Manager

Location:
Mountain View, CA, 94040
Posted:
March 09, 2010

Contact this candidate

Resume:

Data warehouse Consultant with over *+ years of experience in IT industry

in Analysis, Design, Development, Testing, Deployment and Support of large

Client Server Applications and in Data Warehouse projects in multi

terabytes range.

> Over 7 years of experience in Information Technology, having in-depth

knowledge in Analysis, Design, Development, Testing and

Production/Maintenance of Client/Server (OLTP) and Data Warehouse (DSS) &

Data Mining Applications. And Implemented Full life Cycle Projects in

DW/DSS from begin to an end.

> Six years of experience in the Analysis, Design and Development of Data

warehousing solutions and in developing strategies for Extraction,

Transformation and Loading (ETL) mechanism using Ab Initio for full life

cycle development in building a data warehouse

> Strong knowledge in Data Warehousing concepts, Dimensional modeling like

Star Schema and Snowflake Schema

> Worked extensively on Ab Initio GDE 1.15/1.14x/1.13x/1.11x/1.10, Ab

Initio Co-Operating System 2.15/2.14/2.13x/2.10x.

> Specialized in ETL methodology for supporting Data Analysis, Extraction,

Transformations and Loading, in a corporate-wide-ETL Solution using Ab

Initio.

> Expertise in Ab Initio GDE, CO>operating System. Created complex Ab

Initio graphs for data processing, data migration and data analyzing

purpose.

> Expertise in check pointing/phasing, partition, de-partition, normalize,

sort, replicate, assign keys, rollup aggregate, dedup, reformat, FTP and

other miscellaneous component groups.

> Involved in Unit testing of Ab Initio code, Integration Testing, and

System testing in pre-production environment.

> Used EME for version control and project management.

> Configured graph parameters, sandbox parameters, environment variables

and EME Repository environment for Production/Development and performance

tuning for load graph process.

> Expertise in all components in the GDE of Ab Initio for creating,

executing, testing and maintaining graphs in Ab Initio and also

experience with Ab Initio Co-operating System in application tuning and

debugging strategies.

> Developed several Ab Initio complex graphs for transforming, cleansing &

loading Data marts.

> Developed re-usable Ab initio Custom Components and extensively worked on

parallel processing.

> Well versed with various Ab Initio parallelism techniques and implemented

Ab Initio Graphs using Data parallelism and Multi File System (MFS)

techniques.

> Developed graphs to fetch data from sources: DB2, Oracle, teradata, excel

flat files.

> Involved in providing production support to various Ab Initio ETL jobs

and Teradata Database programs.

> Strong knowledge of Data Warehousing concepts and excellent programming

skills with ability to automate routine tasks using UNIX shell scripting

and a good experience in SQL.

> Worked with multi files, partitions and various kinds of joins in

Massively Parallel Processing (MPP) environment with very large Databases

(VLDB).

> Proven experience in Relational Databases (RDBMS) - Teradata V2R5/R4

basics, Teradata SQL and its utilities and tools, Oracle databases

10g/9i/8i/8.x.

> Strong working experience in Teradata utilities (BTEQ, FASTLOAD,

FASTEXPORT, MULTILOAD, Teradata Administrator, SQL Assistant, PMon), UNIX

scripting, PL/SQL, SQL, SQL*Plus, SQL*Loader, Bulk Copy (BCP), Stored

Procedures, Functions, Packages.

> Designed and developed Load/Unload Teradata Utility Scripts: MLOAD,FAST

LOAD,FAST EXPORT

> Worked in Very Large Databases (VLDBs), massive volume data, Table

Partitions, Tablespace, Capacity and I/O Management

> Developed various UNIX shell wrappers to run Ab Initio and Data base

jobs. Practical experience with working on multiple environments like

production, development, testing.

> Experience in integration of various data sources with Multiple

Relational Databases like Teradata, Oracle, SQL Server, and MS Access and

worked on integrating data from Flat files.

> Proficiency in data warehousing techniques for data cleaning, Slowly

Changing Dimension phenomenon, surrogate key assignment and CDC (change

data capture).

> Extensive experience in Operating Systems: Sun Solaris UNIX, HP-Unix, AIX

UNIX, Red Hat Linux & Windows XP/2000//NT/98/95.

> Unique ability to understand and long-term project development issues at

all levels, from interpersonal relationships to the details of coding

scripts with strong analytical, organizational, presentation and problem

solving skills.

TECHNICAL SKILLS:

ETL Tools AB INITIO (Co-Operating System

2.15/2.14/2.13x/2.10x,

GDE 1.15/1.14/1.13x/1.11x/1.10), EME,Informatica

(Power Mart/Power Center) 8.x.

Reporting Tools ReportNet,FrameworkManger,CRNSDK,ReportStudio,

Query Studio, Cognos Connection, Impromptu, Cognos

Query, Metrics Manger, Access Manager, Cognos

Configuration.

Operating Systems Sun Solaris (8,9,10), LINUX, HP UNIX,IBM AIX,

Windows NT

Databases Teradata, Oracle 10g, 9i, 8i,DB2-UDB

Languages SQL,PL/SQL, Shell Scripting

Database Tools BTEQ, FASTLOAD, FASTEXPORT, MULTILOAD, Teradata

Administrator, SQL Assistant, PMon Oracle

Enterprise Manager (OEM), Data Guard Backup and

Recovery Manager (RMAN), PL/SQL, SQL*PLUS, Export

and Import Utility, Oracle Tuning Tools

(STATSPACK, SQL TRACE, TKPROF, Explain Plan),

PROFESSIONAL EXPERIENCE:

EURAMAX, GA Jan 2009 - Till

Date

Sr. Ab Initio Developer

This project consisted of maintaining the customer details of EURAMAX;

customer's transactions, Sales and Credits. This data is updated by-weekly.

The data from flat files are cleaned and the required dml structural change

is made. Automated Test Release Coding is done in order to automate the

entire system

Responsibilities:

. Involved in understanding the Requirements of the end Users/Business

Analysts and Developed Strategies for ETL processes.

. Responsible for the detailed design and documentation

. Provided technical solutions for the Process requests raised by Data

team to fix the issues in the existing system.

. Designed, developed and Unit tested Ab Initio graphs using GDE for

Extraction, Transformation and Loading of data from source to target.

. Extensively used Database and Dataset components like Input file,

Input table, and Output table and transform components like Join,

Rollup, Scan, Filter by expression, Reformat and other components like

Merge, Lookup, Input/output table, Dedup and Sort

. Implemented component level, pipeline and data parallelism using Ab

Initio for ETL process.

. Extensively involved in performing EME dependency analysis.

. Used Partition components like partition by expression, partition by

key, etc., to run the middle layer processing parallel.

. Extensively used various inbuilt transform functions like string

substring, string_lpad, string index, lookup functions, date

functions, error functions.

. Extensively used Ab Initio Co>OP commands like m_ls, m_wc, m_dump,

m_copy, m_mkfs, etc.

. Utilized multi file system (MFS) to execute the graph parallel.

. Extracted data from Teradata Data source tables and created various

lookups like commitment type lookups, and security type lookups

. Worked on improving performance of Ab Initio graphs by using various

Ab Initio performance techniques like using lookups, in memory joins

and rollups to speed up various Ab Initio graphs.

. Implemented ETL processes. Created a loading environment: Wrote Unix

and PL/SQL scripts to create and drop database objects and to

interface with Multiload /Fastload utilities of Teradata.

. Designing, developing and testing Teradata Load/Unload Utility

Scripts: MLOAD, FAST LOAD, And FAST EXPORT.

. Designed and developed parameterized generic graphs

. Extensively worked in the UNIX environment using Shell Scripts.

. Logged and resolved defects in the roll out phase.

. Improved the performance of viewing the Baseline and Actual data from

20 minutes to less than 2 minutes by moving some functionality from Ab

Initio to Teradata.

. Involved in improving the online response time by re-writing optimized

queries and by advising new Primary Indexes on the tables and also in

adding new secondary indexes

. Closely monitored the Control-M batch jobs in ETL batch run during

System, Integration and Acceptance test runs

Environment: Ab Initio GDE 1.15. Co>Operating System 2.15, Control M,

UNIX, TERADATA, Oracle, BTEQ, Teradata SQL Assistant v6.5.

NCSBN, IL Nov 2007 - Dec

2008

Sr.Ab Initio Developer

The project involves implementing outgoing interfaces for customer group

using Ab Initio. It also involves implementing mapping of the sources from

Legacy databases to Oracle database. This system will support the NCSBN

customers for all over the United States.

Responsibilities:

. Involved in analyzing business needs and document functional and

technical specifications based upon user requirements with extensive

interactions with business users.

. Extracted and reformatted data from different heterogeneous data

sources like DB2, SQL Server and Oracle using Ab Initio.

. Developed Ab Initio graphs using various components like Transform,

Partition, Repartition, Database and Datasets.

. Improved the performance of graphs using performance tips and making

use of parallelism features of Ab initio.

. Worked with several transformation components like Rollup, Scan,

Normalize, Denormalize and Reformat to transform the data in the

required format.

. Extensively worked with the Ab Initio Enterprise Meta Environment (EME)

to obtain the initial setup variables and maintaining version control

during the development effort.

. Implemented partition techniques using Partition by Key, Partition by

Expression and Round-Robin techniques on the data unloaded from

multiple tables before sending the data through data quality checks.

. Improved graph performance by eliminating repeated partition/sorts and

unnecessary data landing.

. Implemented Data Parallelism unitizing MFS in the graphs, which deals

with data, divided into segments and operates on each segment

simultaneously through the Ab Initio partition components to segment

data.

. Used phases and checkpoints in AB Initio graphs to facilitate recovery

after failures

. Responsible for deploying Ab Initio graphs as Korn shell scripts.

. Created packages to deploy various .mp, .ksh, .dml, .xfr, and .dbc from

development to test and production environment.

. Involved in developing UNIX Korn Shell wrappers to initialize

variables, run graphs and perform error handling.

. Used Autosys for job scheduling.

Environment: Ab Initio (GDE 1.14, Co>Operating System 2.14), Oracle 9i,

DB2, Sql Server 2000, MS-Access, SQL, PL/SQL, UNIX, Shell scripting,

Windows 2000, Autosys.

Sallie Mae, IN Nov 2006 -

Oct 2007

Ab Initio Developer

The purpose of this project is to provide an IT owned data warehouse for

the Data Analytics team. There was a reporting system in place, but all the

data sources are stored in various systems and pulled together in

inconsistent manner. The goal is to provide a consistent interface for the

reporting and analysis that is required for Sallie Mae. This project will

utilize Ab Initio to normalize the data. Business Objects as the reporting

tool for consistent reports for all users. The backend storage will be

leveraged with the existing EDW environment to avoid extra costs and also

use existing system expertise.

. Analyzed Business and Accounting requirements from the Accounting and

Business Detail level Process design.

. Involved in understanding the Requirements of the end Users/Business

Analysts and Developed Strategies for ETL processes.

. Responsible for the detailed design and documentation. Provided

technical solutions for the Process requests raised by Data team to fix

the issues in the existing system.

. Designed, developed and Unit tested Ab Initio graphs using GDE for

Extraction, Transformation and Loading of data from source to target.

. Extracted data from Teradata Data source tables and created various

lookups like commitment type lookups, and security type lookups.

. Involved in writing test cases to validate the code changes.

. Extensively used Database and Dataset components like Input file, Input

table, and Output table and Transform components like Join, Rollup,

Scan, Filter by expression, Reformat and other components like Merge,

Lookup, Input/output table and Sort.

. Implemented component level, pipeline and data parallelism using Ab

Initio for ETL process.

. Extensively involved in performing EME dependency analysis.

. Involved in Ab Initio upgrade from GDE 1.13 to GDE 1.14.35.

. Used Partition components like partition by expression, partition by

key, etc., to run the middle layer processing parallel.

. Extensively used various inbuilt transform functions like string

substring, string_lpad, string index, lookup functions, date functions,

error functions.

. Extensively used Ab Initio Co>OS commands like m_ls, m_wc, m_dump,

m_copy, m_mkfs, etc. Utilized multi file system (MFS) to execute the

graph parallel.

. Worked on improving performance of Ab Initio graphs by using various Ab

Initio performance techniques like using lookups, in memory joins and

rollups to speed up various Ab Initio graphs. Designed and developed

parameterized generic graphs.

. Closely monitored the Autosys batch jobs in ETL batch run during System,

Integration and Acceptance test runs..

. Extensively worked in the UNIX environment using Shell Scripts. Created

test cases and performed unit testing for the Ab Initio graphs.

Documented Unit testing. Logged and resolved defects in the roll out

phase

. Created a Production support document and documented the Test case work

book, High level Design and Detail Design documents.

Environment: Ab Initio (GDE 1.13, Co>Op 2.13, TERADATA, Teradata SQL

Assistant, BTEQ, UNIX, SQL, TOAD, Windows XP, SQL Plus, Business Objects.

Independent Health, Buffalo,NY

Nov 2004 - Oct 2006

Ab Initio Developer/Data Analyst

Responsible for the design, implementations, maintenance of Customer data

mart, running reports at Independent Health. The project involves pulling

large amounts of customer data, from various disparate data sources,

operational data store (ODS) and OLTP systems having DB2 database source,

performing data cleansing and transformation operations on the data and

finally loading into Oracle database as target again.

Responsibilities:

. Involved in designing the Data model.

. Requirement collection for the ETL processes of the merchant data mart.

. Involved in the high level design of the data mart (Snowflake Schema).

. Analyzed the systems, met with end users and business units in order to

define the requirements.

. Documented user requirements, translated requirements into system

solutions and develop implementation plan and schedule.

. Interpreted logical and physical data models for Business users to

determine common data definitions and establish referential integrity of

the system.

. Extracted, transformed and loaded data from operational data sources into

target Oracle database.

. Created sandbox utility using shell scripts for individual development

and testing environment. Also created multi-file system using shell

scripts to store partitioned data on different nodes (SMP Architecture).

. Responsible for data validation and verification of business

transformation rules.

. Used Ab Initio partitioning and filtering components to achieve parallel

data transformations and filtering.

. Used Partition by Key & Sort, Partition By Expression, Filter By

Expression, Partition By Round Robin, Sort, Reformat, Join, Rollup,

Gather, Redefine Format, Replicate, Lookup File, Generate Records,

Concatenate, Scan, Normalize components to develop the ETL transformation

logic.

. Deployed the Ab Initio graphs as Korn shell scripts and invoked them

using wrapper scripts.

. Did Job Scheduling using Maestro.

. Prepared and implemented data verification and testing methods for the

data warehouse and involved in the unit testing.

. Extensively worked in the performance tuning of the programs, ETL

Procedures and processes.

. Provided technical assistance and support to IT analysts and business

objects users.

Environment: Ab Initio 1.12, DB2, Oracle 9i, Erwin 4.0, Business Objects,

SQL, PL/SQL, UNIX shell scripting, AIX 4.3, Windows 2000/NT

Housing Development Finance Corporation (HDFC), India May 2003 - Oct

2004

ETL Developer

I was a part of the team working on the Housing Development Finance

Corporation (HDFC) project .The project involves pulling large amounts of

merchant data, from various disparate data sources, operational data store

(ODS) and OLTP systems having oracle database source, performing data

cleansing and transformation operations on the data and finally loading

into bankruptcy filing, creditline_decrement, Aggregate Financial,

GL_fiscal, GL_Journal_Entry etc. in Oracle as target .

Responsibilities:

. Extensively worked on Ab Initio GDE Component organizer Designed and

implemented AbInitio Graphs, Sub Graphs using various components such as

dedup sorted, partition by key, reformat, filter by expression, gather,

merge, etc.

. Extensively involved in Ab Initio Graph Design, development and

Performance tuning.

. Developed Data transformation, loading, scrubbing and extraction Programs

using Ab Initio.

. Used inquiry and error functions like is_valid, is_error, is_defined and

string functions like string_substring, string_concat and other string_*

functions in developing Ab Initio graphs to perform data validation and

data cleansing.

. Implemented Data Parallelism through graphs, by using Ab Initio partition

components.

. Developed parameterized generic graphs for data extraction and load.

. Worked on system testing and troubleshooting defects.

. Prepared thorough documentation for the code and test strategies.

Environment: Ab Initio, Oracle 8i, PL/SQL & Visual SourceSafe.

.

ICICI Infotech, India

March 2002 - April 2003

Oracle Developer

Designed and developed the order Processing system which involved the all

the orders from dealers. It starts from dealer order and ends at invoicing.

Responsibilities

. Created several Procedures, Functions, Packages and Database Triggers to

implement the functionality in PL/SQL and Database Administration

. Designed the User Interfaces using Forms.

. Performed coding and testing of the Application

. Interacted with the users during testing phase.

. Involved in Training of application to Users.

. Created Tables and relationship among the tables so as to maintain the

Referential Integrity.

. Worked with Table constraints, Views and Sequences.

. Wrote Stored Procedures, Database triggers.

. Created Menus and indexes.

. Prepared User Manual.

. Involved in training of application to Users.

Environment: Windows 98, Oracle 7.1, Forms 3.0, SQL * PLUS 8.0, PL/SQL,

Developer 2000



Contact this candidate