Data warehouse Consultant with over *+ years of experience in IT industry
in Analysis, Design, Development, Testing, Deployment and Support of large
Client Server Applications and in Data Warehouse projects in multi
terabytes range.
> Over 7 years of experience in Information Technology, having in-depth
knowledge in Analysis, Design, Development, Testing and
Production/Maintenance of Client/Server (OLTP) and Data Warehouse (DSS) &
Data Mining Applications. And Implemented Full life Cycle Projects in
DW/DSS from begin to an end.
> Six years of experience in the Analysis, Design and Development of Data
warehousing solutions and in developing strategies for Extraction,
Transformation and Loading (ETL) mechanism using Ab Initio for full life
cycle development in building a data warehouse
> Strong knowledge in Data Warehousing concepts, Dimensional modeling like
Star Schema and Snowflake Schema
> Worked extensively on Ab Initio GDE 1.15/1.14x/1.13x/1.11x/1.10, Ab
Initio Co-Operating System 2.15/2.14/2.13x/2.10x.
> Specialized in ETL methodology for supporting Data Analysis, Extraction,
Transformations and Loading, in a corporate-wide-ETL Solution using Ab
Initio.
> Expertise in Ab Initio GDE, CO>operating System. Created complex Ab
Initio graphs for data processing, data migration and data analyzing
purpose.
> Expertise in check pointing/phasing, partition, de-partition, normalize,
sort, replicate, assign keys, rollup aggregate, dedup, reformat, FTP and
other miscellaneous component groups.
> Involved in Unit testing of Ab Initio code, Integration Testing, and
System testing in pre-production environment.
> Used EME for version control and project management.
> Configured graph parameters, sandbox parameters, environment variables
and EME Repository environment for Production/Development and performance
tuning for load graph process.
> Expertise in all components in the GDE of Ab Initio for creating,
executing, testing and maintaining graphs in Ab Initio and also
experience with Ab Initio Co-operating System in application tuning and
debugging strategies.
> Developed several Ab Initio complex graphs for transforming, cleansing &
loading Data marts.
> Developed re-usable Ab initio Custom Components and extensively worked on
parallel processing.
> Well versed with various Ab Initio parallelism techniques and implemented
Ab Initio Graphs using Data parallelism and Multi File System (MFS)
techniques.
> Developed graphs to fetch data from sources: DB2, Oracle, teradata, excel
flat files.
> Involved in providing production support to various Ab Initio ETL jobs
and Teradata Database programs.
> Strong knowledge of Data Warehousing concepts and excellent programming
skills with ability to automate routine tasks using UNIX shell scripting
and a good experience in SQL.
> Worked with multi files, partitions and various kinds of joins in
Massively Parallel Processing (MPP) environment with very large Databases
(VLDB).
> Proven experience in Relational Databases (RDBMS) - Teradata V2R5/R4
basics, Teradata SQL and its utilities and tools, Oracle databases
10g/9i/8i/8.x.
> Strong working experience in Teradata utilities (BTEQ, FASTLOAD,
FASTEXPORT, MULTILOAD, Teradata Administrator, SQL Assistant, PMon), UNIX
scripting, PL/SQL, SQL, SQL*Plus, SQL*Loader, Bulk Copy (BCP), Stored
Procedures, Functions, Packages.
> Designed and developed Load/Unload Teradata Utility Scripts: MLOAD,FAST
LOAD,FAST EXPORT
> Worked in Very Large Databases (VLDBs), massive volume data, Table
Partitions, Tablespace, Capacity and I/O Management
> Developed various UNIX shell wrappers to run Ab Initio and Data base
jobs. Practical experience with working on multiple environments like
production, development, testing.
> Experience in integration of various data sources with Multiple
Relational Databases like Teradata, Oracle, SQL Server, and MS Access and
worked on integrating data from Flat files.
> Proficiency in data warehousing techniques for data cleaning, Slowly
Changing Dimension phenomenon, surrogate key assignment and CDC (change
data capture).
> Extensive experience in Operating Systems: Sun Solaris UNIX, HP-Unix, AIX
UNIX, Red Hat Linux & Windows XP/2000//NT/98/95.
> Unique ability to understand and long-term project development issues at
all levels, from interpersonal relationships to the details of coding
scripts with strong analytical, organizational, presentation and problem
solving skills.
TECHNICAL SKILLS:
ETL Tools AB INITIO (Co-Operating System
2.15/2.14/2.13x/2.10x,
GDE 1.15/1.14/1.13x/1.11x/1.10), EME,Informatica
(Power Mart/Power Center) 8.x.
Reporting Tools ReportNet,FrameworkManger,CRNSDK,ReportStudio,
Query Studio, Cognos Connection, Impromptu, Cognos
Query, Metrics Manger, Access Manager, Cognos
Configuration.
Operating Systems Sun Solaris (8,9,10), LINUX, HP UNIX,IBM AIX,
Windows NT
Databases Teradata, Oracle 10g, 9i, 8i,DB2-UDB
Languages SQL,PL/SQL, Shell Scripting
Database Tools BTEQ, FASTLOAD, FASTEXPORT, MULTILOAD, Teradata
Administrator, SQL Assistant, PMon Oracle
Enterprise Manager (OEM), Data Guard Backup and
Recovery Manager (RMAN), PL/SQL, SQL*PLUS, Export
and Import Utility, Oracle Tuning Tools
(STATSPACK, SQL TRACE, TKPROF, Explain Plan),
PROFESSIONAL EXPERIENCE:
EURAMAX, GA Jan 2009 - Till
Date
Sr. Ab Initio Developer
This project consisted of maintaining the customer details of EURAMAX;
customer's transactions, Sales and Credits. This data is updated by-weekly.
The data from flat files are cleaned and the required dml structural change
is made. Automated Test Release Coding is done in order to automate the
entire system
Responsibilities:
. Involved in understanding the Requirements of the end Users/Business
Analysts and Developed Strategies for ETL processes.
. Responsible for the detailed design and documentation
. Provided technical solutions for the Process requests raised by Data
team to fix the issues in the existing system.
. Designed, developed and Unit tested Ab Initio graphs using GDE for
Extraction, Transformation and Loading of data from source to target.
. Extensively used Database and Dataset components like Input file,
Input table, and Output table and transform components like Join,
Rollup, Scan, Filter by expression, Reformat and other components like
Merge, Lookup, Input/output table, Dedup and Sort
. Implemented component level, pipeline and data parallelism using Ab
Initio for ETL process.
. Extensively involved in performing EME dependency analysis.
. Used Partition components like partition by expression, partition by
key, etc., to run the middle layer processing parallel.
. Extensively used various inbuilt transform functions like string
substring, string_lpad, string index, lookup functions, date
functions, error functions.
. Extensively used Ab Initio Co>OP commands like m_ls, m_wc, m_dump,
m_copy, m_mkfs, etc.
. Utilized multi file system (MFS) to execute the graph parallel.
. Extracted data from Teradata Data source tables and created various
lookups like commitment type lookups, and security type lookups
. Worked on improving performance of Ab Initio graphs by using various
Ab Initio performance techniques like using lookups, in memory joins
and rollups to speed up various Ab Initio graphs.
. Implemented ETL processes. Created a loading environment: Wrote Unix
and PL/SQL scripts to create and drop database objects and to
interface with Multiload /Fastload utilities of Teradata.
. Designing, developing and testing Teradata Load/Unload Utility
Scripts: MLOAD, FAST LOAD, And FAST EXPORT.
. Designed and developed parameterized generic graphs
. Extensively worked in the UNIX environment using Shell Scripts.
. Logged and resolved defects in the roll out phase.
. Improved the performance of viewing the Baseline and Actual data from
20 minutes to less than 2 minutes by moving some functionality from Ab
Initio to Teradata.
. Involved in improving the online response time by re-writing optimized
queries and by advising new Primary Indexes on the tables and also in
adding new secondary indexes
. Closely monitored the Control-M batch jobs in ETL batch run during
System, Integration and Acceptance test runs
Environment: Ab Initio GDE 1.15. Co>Operating System 2.15, Control M,
UNIX, TERADATA, Oracle, BTEQ, Teradata SQL Assistant v6.5.
NCSBN, IL Nov 2007 - Dec
2008
Sr.Ab Initio Developer
The project involves implementing outgoing interfaces for customer group
using Ab Initio. It also involves implementing mapping of the sources from
Legacy databases to Oracle database. This system will support the NCSBN
customers for all over the United States.
Responsibilities:
. Involved in analyzing business needs and document functional and
technical specifications based upon user requirements with extensive
interactions with business users.
. Extracted and reformatted data from different heterogeneous data
sources like DB2, SQL Server and Oracle using Ab Initio.
. Developed Ab Initio graphs using various components like Transform,
Partition, Repartition, Database and Datasets.
. Improved the performance of graphs using performance tips and making
use of parallelism features of Ab initio.
. Worked with several transformation components like Rollup, Scan,
Normalize, Denormalize and Reformat to transform the data in the
required format.
. Extensively worked with the Ab Initio Enterprise Meta Environment (EME)
to obtain the initial setup variables and maintaining version control
during the development effort.
. Implemented partition techniques using Partition by Key, Partition by
Expression and Round-Robin techniques on the data unloaded from
multiple tables before sending the data through data quality checks.
. Improved graph performance by eliminating repeated partition/sorts and
unnecessary data landing.
. Implemented Data Parallelism unitizing MFS in the graphs, which deals
with data, divided into segments and operates on each segment
simultaneously through the Ab Initio partition components to segment
data.
. Used phases and checkpoints in AB Initio graphs to facilitate recovery
after failures
. Responsible for deploying Ab Initio graphs as Korn shell scripts.
. Created packages to deploy various .mp, .ksh, .dml, .xfr, and .dbc from
development to test and production environment.
. Involved in developing UNIX Korn Shell wrappers to initialize
variables, run graphs and perform error handling.
. Used Autosys for job scheduling.
Environment: Ab Initio (GDE 1.14, Co>Operating System 2.14), Oracle 9i,
DB2, Sql Server 2000, MS-Access, SQL, PL/SQL, UNIX, Shell scripting,
Windows 2000, Autosys.
Sallie Mae, IN Nov 2006 -
Oct 2007
Ab Initio Developer
The purpose of this project is to provide an IT owned data warehouse for
the Data Analytics team. There was a reporting system in place, but all the
data sources are stored in various systems and pulled together in
inconsistent manner. The goal is to provide a consistent interface for the
reporting and analysis that is required for Sallie Mae. This project will
utilize Ab Initio to normalize the data. Business Objects as the reporting
tool for consistent reports for all users. The backend storage will be
leveraged with the existing EDW environment to avoid extra costs and also
use existing system expertise.
. Analyzed Business and Accounting requirements from the Accounting and
Business Detail level Process design.
. Involved in understanding the Requirements of the end Users/Business
Analysts and Developed Strategies for ETL processes.
. Responsible for the detailed design and documentation. Provided
technical solutions for the Process requests raised by Data team to fix
the issues in the existing system.
. Designed, developed and Unit tested Ab Initio graphs using GDE for
Extraction, Transformation and Loading of data from source to target.
. Extracted data from Teradata Data source tables and created various
lookups like commitment type lookups, and security type lookups.
. Involved in writing test cases to validate the code changes.
. Extensively used Database and Dataset components like Input file, Input
table, and Output table and Transform components like Join, Rollup,
Scan, Filter by expression, Reformat and other components like Merge,
Lookup, Input/output table and Sort.
. Implemented component level, pipeline and data parallelism using Ab
Initio for ETL process.
. Extensively involved in performing EME dependency analysis.
. Involved in Ab Initio upgrade from GDE 1.13 to GDE 1.14.35.
. Used Partition components like partition by expression, partition by
key, etc., to run the middle layer processing parallel.
. Extensively used various inbuilt transform functions like string
substring, string_lpad, string index, lookup functions, date functions,
error functions.
. Extensively used Ab Initio Co>OS commands like m_ls, m_wc, m_dump,
m_copy, m_mkfs, etc. Utilized multi file system (MFS) to execute the
graph parallel.
. Worked on improving performance of Ab Initio graphs by using various Ab
Initio performance techniques like using lookups, in memory joins and
rollups to speed up various Ab Initio graphs. Designed and developed
parameterized generic graphs.
. Closely monitored the Autosys batch jobs in ETL batch run during System,
Integration and Acceptance test runs..
. Extensively worked in the UNIX environment using Shell Scripts. Created
test cases and performed unit testing for the Ab Initio graphs.
Documented Unit testing. Logged and resolved defects in the roll out
phase
. Created a Production support document and documented the Test case work
book, High level Design and Detail Design documents.
Environment: Ab Initio (GDE 1.13, Co>Op 2.13, TERADATA, Teradata SQL
Assistant, BTEQ, UNIX, SQL, TOAD, Windows XP, SQL Plus, Business Objects.
Independent Health, Buffalo,NY
Nov 2004 - Oct 2006
Ab Initio Developer/Data Analyst
Responsible for the design, implementations, maintenance of Customer data
mart, running reports at Independent Health. The project involves pulling
large amounts of customer data, from various disparate data sources,
operational data store (ODS) and OLTP systems having DB2 database source,
performing data cleansing and transformation operations on the data and
finally loading into Oracle database as target again.
Responsibilities:
. Involved in designing the Data model.
. Requirement collection for the ETL processes of the merchant data mart.
. Involved in the high level design of the data mart (Snowflake Schema).
. Analyzed the systems, met with end users and business units in order to
define the requirements.
. Documented user requirements, translated requirements into system
solutions and develop implementation plan and schedule.
. Interpreted logical and physical data models for Business users to
determine common data definitions and establish referential integrity of
the system.
. Extracted, transformed and loaded data from operational data sources into
target Oracle database.
. Created sandbox utility using shell scripts for individual development
and testing environment. Also created multi-file system using shell
scripts to store partitioned data on different nodes (SMP Architecture).
. Responsible for data validation and verification of business
transformation rules.
. Used Ab Initio partitioning and filtering components to achieve parallel
data transformations and filtering.
. Used Partition by Key & Sort, Partition By Expression, Filter By
Expression, Partition By Round Robin, Sort, Reformat, Join, Rollup,
Gather, Redefine Format, Replicate, Lookup File, Generate Records,
Concatenate, Scan, Normalize components to develop the ETL transformation
logic.
. Deployed the Ab Initio graphs as Korn shell scripts and invoked them
using wrapper scripts.
. Did Job Scheduling using Maestro.
. Prepared and implemented data verification and testing methods for the
data warehouse and involved in the unit testing.
. Extensively worked in the performance tuning of the programs, ETL
Procedures and processes.
. Provided technical assistance and support to IT analysts and business
objects users.
Environment: Ab Initio 1.12, DB2, Oracle 9i, Erwin 4.0, Business Objects,
SQL, PL/SQL, UNIX shell scripting, AIX 4.3, Windows 2000/NT
Housing Development Finance Corporation (HDFC), India May 2003 - Oct
2004
ETL Developer
I was a part of the team working on the Housing Development Finance
Corporation (HDFC) project .The project involves pulling large amounts of
merchant data, from various disparate data sources, operational data store
(ODS) and OLTP systems having oracle database source, performing data
cleansing and transformation operations on the data and finally loading
into bankruptcy filing, creditline_decrement, Aggregate Financial,
GL_fiscal, GL_Journal_Entry etc. in Oracle as target .
Responsibilities:
. Extensively worked on Ab Initio GDE Component organizer Designed and
implemented AbInitio Graphs, Sub Graphs using various components such as
dedup sorted, partition by key, reformat, filter by expression, gather,
merge, etc.
. Extensively involved in Ab Initio Graph Design, development and
Performance tuning.
. Developed Data transformation, loading, scrubbing and extraction Programs
using Ab Initio.
. Used inquiry and error functions like is_valid, is_error, is_defined and
string functions like string_substring, string_concat and other string_*
functions in developing Ab Initio graphs to perform data validation and
data cleansing.
. Implemented Data Parallelism through graphs, by using Ab Initio partition
components.
. Developed parameterized generic graphs for data extraction and load.
. Worked on system testing and troubleshooting defects.
. Prepared thorough documentation for the code and test strategies.
Environment: Ab Initio, Oracle 8i, PL/SQL & Visual SourceSafe.
.
ICICI Infotech, India
March 2002 - April 2003
Oracle Developer
Designed and developed the order Processing system which involved the all
the orders from dealers. It starts from dealer order and ends at invoicing.
Responsibilities
. Created several Procedures, Functions, Packages and Database Triggers to
implement the functionality in PL/SQL and Database Administration
. Designed the User Interfaces using Forms.
. Performed coding and testing of the Application
. Interacted with the users during testing phase.
. Involved in Training of application to Users.
. Created Tables and relationship among the tables so as to maintain the
Referential Integrity.
. Worked with Table constraints, Views and Sequences.
. Wrote Stored Procedures, Database triggers.
. Created Menus and indexes.
. Prepared User Manual.
. Involved in training of application to Users.
Environment: Windows 98, Oracle 7.1, Forms 3.0, SQL * PLUS 8.0, PL/SQL,
Developer 2000