Priyanka Dontula
[pic]
[pic]
Professional Summary
. Around 8+ years of IT experience in the System Analysis, Requirement
gathering, Design, Development, Implementation, Testing and Production
support of Database in Data warehousing performing Data Analysis, Data
Modeling, Data Extraction, Data Transformation, Data Loading.
. Experience in ETL & Business Intelligence using IBM Infosphere
Datastage V8.5, Ascential Datastage 7.5(DataStage Manager, DataStage
Designer, DataStage Director, Parallel Extender), creating Fact
Tables, Dimension Tables using Star Schema Modeling.
. Used IBM Infosphere Information Server Manager for deployment in
different environments like Development,Testing and Production and to
mange the versions of datastage jobs.
. Extensively made use of stages like Sequential files, Transformer,
Aggregator, Sort, Join, Change Capture,Lookup in Datastage Designer.
. Used Datastage Director to debug, validate, schedule, run and monitor
Datastage jobs.
. Used Datastage Administrator for creating environment variables.
. Developed ETL Parallel jobs using IBM Infosphere DataStage V8.5 to
extract data from source system and landed the data into a staging
area and thereby into Data mart and have a good knowledge on Datastage
architecture.
. Excellent experience in Relational database (RDBMS), Oracle, SQL
Server, DB2 and MS Access.
. Played an integral part in the building of a multi-server, multi-
database enterprise Data warehouse using DataStage ETL (extact,
transform and load) tools and SQL Server to load legacy business data.
. Designed logical and physical database configurations for large-scale
data warehouse and OLTP implementations using Erwin.
. Experienced in creating High-Level and Low-Level Design Documents as
per the project requirement document and creating the mapping document
for all the tables in the project.
. Experienced in Quality Assurance creating for Data Warehousing
projects creating Test Plans, Test Objectives, Test Strategies, and
Test Cases. Ensuring the Data in data warehouse meets the business
requirements.
. Strong understanding of Data Warehousing concepts Experience.
Expertise in KIMBALL methodology.
. Adept in all stages of SDLC (Software Development Life Cycle).
. Capable of working under high stress environment with resource
constraints.
. Excellent analytical, communication, and facilitation skills with the
ability to gain consensus across multiple teams.
Technical Summary
. ETL Tools : IBM Infosphere Datastage and Quality Stage 8.5, Ascential
DataStage Enterprise and 7.x Editions, Informatica
. ETL Deployment Tool: IBM Information Server Manager.
. Operating Systems : UNIX, Windows 95/NT/2000 and MS-DOS.
. Languages : Korn Shell Programming, HTML.
. RDBMS : SQL, SQLServer2005/2008, Oracle10g/9i/8i
. Web Server : IBM Websphere.
. Reporting Tools: Cognos 7.x/8, Business Objects,OBIEE
Certifications:-
IBM Websphere for Datastage 8.0 Certified.
Training Programs:-
Informatica 7.x training for datawarehousing, Cognos 8.0 reporting tool
training program.
Memberships:-
Member of TDWI (The Data Warehousing Intitute). It is a community of
learning where datawarehousing professionals come together to gain
knowledge and skills, network with peers, and advance their careers.
Professional Experience
Central Hudson Gas & Electric
Dec '12 - Mar '13
Poughkeepsie,NY
Datastage Lead Developer
CH Energy Group offers electricity, natural gas, propane, fuel oil and
other petroleum products, along with superior energy services for our
residential and business customers. The IEA project is to develop a
reporting environment that can generate monthly summary reports on electric
activities in Mega Watts in an accurate and timely way to enable electric
business leadership.
Environment: IBM Infosphere Datastage and Quality Stage 8.5, Aginity
Netezza 10, DB2, SQL Server, UNIX, WINSCP, Putty
. Involved in gathering Business requirements from the users and
documenting in a High level design document and implemented the ETL
Datastage jobs.
. Developed ETL Parallel jobs using IBM Information DataStage and
Quality Stage Designer V8.5 to extract data from different source
systems like mainframe data and transforming the data as per the
requirement and loading into the Datamart.
. Designed and developed jobs using DataStage Designer as per the
mapping specifications using appropriate stages.
. Created source to target mapping and job design documents from
staging area to ODS.
. Developed Datastage jobs for Initial and Delta Load in fact table.
. Replaced Change captured stage with Join stage when there is no need
to capture column changes and replaced lookup stage with join stage
when dealing with large amount of data.
. Designed Data Stage Sequence jobs to specify Job execution order and
for mail notification and for terminate request if any of the
Datastage job fails.
. Used DataStage Director and its run-time engine to schedule the
jobs, testing and debugging its components, and monitoring the
resulting executable versions (on an ad hoc or scheduled basis).
. Parameterized the Jobs to have the ability to move the jobs from
Development to Production environment with minimum modification
between the environments.
. Meeting with business users for any changes requests or change advices
in the project report.
. Participated in weekly status meetings, and conducting internal
and external reviews as well as formal walkthroughs among various
teams, and documenting the proceedings
New York City Department Of Education
Oct '11 - Nov '12
Brooklyn,NY
Datastage Lead Developer
NYCDOE manages the city's public school system divided in 5 Boroughs having
around 1700 schools with 1.1 Milliion Student.DataCore ILearn is an
application that allows students to register courses online through
different vendors. Different data streams from various vendors need to be
processed by datastage using Quality Stage for data profiling.
Environment: IBM Infosphere Datastage and Quality Stage 8.5, IBM Infosphere
Information Server Manager, Oracle SQL Developer, SQL Server Management
Studio V9, DB2, VSS, UNIX,WINSCP
. Involved with Business Analysts to understand the business requirement
specifications (High level design document) then translate them into
technical documents (Low level design document) and implemented the
ETL Datastage jobs.
. Developed ETL Parallel jobs using IBM Information DataStage and
Quality Stage Designer V8.5 to extract data from different source
systems and transforming the data as per the requirement and
loading into the Datamart.
. Designed and developed jobs using DataStage Designer as per the
mapping specifications using appropriate stages.
. Used MQ Plug-ins to get read and process the MQ Messages.
. Created source to target mapping and job design documents from
staging area to ODS.
. Developed Datastage jobs for handling the daily syncs/incremental data
based on last updated date to implement SCD1 and SCD2 in target data ware
house.
. Worked on Datastage jobs performance tuning by using various performance
techniques.
. Replaced Change captured stage with Join stage when there is no need
to capture column changes and replaced lookup stage with join stage
when dealing with large amount of data.
. Designed Data Stage Sequence jobs to specify Job execution order and
for mail notification and for terminate request if any of the
Datastage job fails.
. Deployed/developed the solutions that maximize the consistency and
usability of data using Datastage
. Used DataStage Director and its run-time engine to schedule running
the solution, testing and debugging its components, and monitoring
the resulting executable versions (on an ad hoc or scheduled
basis).
. Parameterized the Jobs to have the ability to move the jobs from
Development to Production environment with minimum modification
between the environments.
. Used IBM Infosphere Information Server Manager to handle jobs versions
in all the environments like Development, Integration, QA and
Production.
. Participated in weekly status meetings, and conducting internal
and external reviews as well as formal walkthroughs among various
teams, and documenting the proceedings
United Health Group
Apr '10 - Sep '11
Hartford CT
Datastage Lead Developer
UnitedHealth Group is a major health insurance provider in USA. Global
Solutions is a UnitedHealth International product to provide quality and
innovative solutions that help expatriates, inpatriates, third country
nationals and key local nationals lead healthier lives anywhere, anytime.
From Datastage perspective, since there is no current process to identify
the Global solutions indicator certain key fields needed to be added to the
current claim feed to PE to identify the Global Solutions indicator and
send the info to the PE on the provider feed for foreign provider claims.
Certain changes were to be made to the existing jobs and the DB2 tables to
accomodate the new field(Global Solutions Indicator).
Environment: IBM Ascential Datastage 7.5, Autosys, Oracle10g, DB2, Toad,
Shell Scripts, UNIX
Responsibilities:
. Worked on claims to Extract and Transform and load the data in to Data
Integration Hub using Datastage.
. Developed the Datastage jobs for handling the daily syncs/incremental
data based on last updated date for implement SCD1 and SCD2 in target
data ware house.
. Involved with business analysts to understand the business requirement
specifications (High level design document) then translate them in to
technical documents (Low level design document) and implemented the
ETL Datastage jobs.
. Worked on Datastage jobs performance tuning by using various performance
techniques.
. Developed ETL Parallel jobs using IBM Information Server DataStage
V7.5 to extract claim information from source system on DB2
database and landed the data into a staging area.
. Replaced Change captured stage with Join stage when there is no need
of capture all column changes and replaced lookup stage with join
stage when deal with large amount of data
. Developed jobs in IBM Web Sphere Parallel Extender PX using
different stages like Transformer, Aggregator, Lookup, Join, Merge,
Modify, Remove Duplicate, Sort, Peek, Change capture, Filter, Copy,
Sequential File, and Data Set.
. Designed ETL jobs to identify and remove duplicate rows using remove
Duplicate stage and using all other
stages(Xfrm,Join,Sort,Lookup,basically we can use every stage for getting
unique rows)
. Designed Data Stage Sequence jobs to specify Job execution order and
for mail notification and for terminate request if any of the
Datastage job fails.
. Deployed/developed the solutions that maximize the consistency and
usability of data using Datastage
. Extensively used Datastage Director for monitoring and debugging of
jobs and sequences. And cleared the orphan process.
. Extensively used Autosys Scheduler Tool to schedule Data Stage jobs
and created autosys jobs and Boxes
. Parameterized the Jobs to have the ability to move the jobs from
Development to Production environment with minimum modification
between the environments.
. Unit Tested Data Stage Jobs in development once development is done by
using negative and positive test cases for make sure that Datastage code
is working according to business demands
. Signed off System Integration Testing and User Acceptance Testing and
supported the issues when business is testing the code.
GMAC INSURANCE
Feb '09 - Mar '10
Greensboro, NC
Senior Datastage Developer
GMAC is a vehicle insurance company located in USA and UK. The Interfaces
that GMAC sends are either inbound or outbound to an application called
iWarranty. GMAC sends the source files in .CSV or .txt format to iWarranty
application which are used by Datastage Component for further processing.
All the inbound target files are kept in Tumbleweed Server (a batch
process) from where they are pulled by target system. For Outbound
interfaces source will be a vendor to GMAC and iWarranty will the target.
Environment: IBM Web Sphere DataStage 7.5, Oracle 9i, PL/SQL, SQL Server
2008, Toad, UNIX, Windows Server 2003.
Responsibilities:
. Worked with Business Analyst to identify, develop business
requirements, transform it into technical requirements and
responsible for deliverables.
. Used Datastage Designer to develop jobs for Extracting, Cleansing,
Transforming and Loading data into data warehouse.
. Developed several Server and Parallel jobs to improve performance by
reducing runtime using different partitioning techniques.
. Set standards for developing new Datastage Jobs, deployment and
fixing production issues.
. Actively involved in Dimensional modeling for identifying the
measures, dimension tables, fact table and aggregator tables.
. Designed jobs using DataStage Designer to retrieve the data from
staging area and load the transformed data into data mart on SQL
Server 2008, to create several downstream feeds and load the cleansed
data into Data mart and Repository
. Created Shared Containers to simplify the Data Stage design, to use as
a common job component throughout the project.
. Extensively used Data stage Designer, Administrator and Director for
creating and validating jobs.
. Used Transformer stage for data conversion, for handling the data
mismatch and for generating the surrogate keys to improve the
performance.
. Created the Project documentation, which involved the description
of the project as a whole and for each job. Short description of
each stage, tables, and routines used.
. Created Error Files and Log Tables containing data with
discrepancies to analyze and re-process the data.
. Involved in Performance Fine Tuning of ETL programs. Tuned DataStage
and Stored procedures Code.
. Developed the automated and scheduled load processes using UNIX shell
scripting & Autosys.
. Developed UNIX scripts based on the requirement like checking the
file availability, format checking and triggering the datastage job
from the scripts.
. Archiving the generated target files, error files and audit files.
. Made changes based on CA and CR's for the developed Interfaces.
Michelin
Mar '08 - Jan '09
Greenville SC
Senior Datastage Developer
Michelin relies on its global industrial presence backed by a sales
network in 170 countries. In order to establish the best possible
strategy to meet - and anticipate - its customers' needs, the
Michelin Group is organized into product lines, each one dedicated
to an area of activity, with its own marketing, development,
production and marketing resources. This project aims at developing
an interface where it can estimate the request order for customers
and deliver it accordingly.
Environment: DataStage7.5, Oracle9i, Toad,SQL, Erwin4.5.2 and Windows
Server 2003.
Responsibilities:
. Worked with Business Analyst to identify, develop business
requirements, transform it into technical requirements and responsible
for deliverables.
. Involved in the analysis of Physical Data Model for ETL Source to
Target mapping and the process flow diagrams for all the business
functions.
. Built Data Warehouse from scratch with Star schema model.
. Used the Data stage Administrator to assign Privileges to users or
user groups (to control which Data Stage client applications or jobs
they see or run), move, rename, or delete projects, and manage publish
jobs from development to production status.
. Used the Data stage Designer to develop processes for extracting,
cleansing, transforming, integrating, and loading data into data
warehouse.
. Designing and Developing PL/SQL Procedures, functions, and packages to
create Summary tables.
. Implemented Log-based change data capture (CDC) for delta loading.
. Designed guide lines for Source based CDC & Target based CDC
. Defined Stage variables for data validations and data filtering
process.
. Tuned Datastage jobs to obtain better performance.
. Performed User Acceptance Testing and System Integration Testing.
Schneider Oct'07 - Feb '08
Chicago IL
Senior Informatica Developer
Schneider Electric delivers total infrastructure solutions for healthcare
facilities that make their energy more efficient, reliable, productive,
safer, and greener. This project aims at developing mappings and modifying
existing mappings in the informatica based application INFOR. This
application gets files from SMS system, transforms them and generates files
in the format required by SMS system.
Environment: Informatica 7.1.1, Oracle 9i, Toad
Responsibilities:
. Analyzing business processes, functions, existing transactional
database schemas and designing star schema models to support the users
reporting needs and requirements.
. Designed and developed the Mappings using various transformations to
suit the business user requirements and business rules to load data
from Oracle, SQL Server, Sybase, DB2, flat file and XML file sources
targeting the views (views on the target tables) in the target
database (Oracle).
. Prepared ETL Specifications and design documents to help develop
mappings.
. Implemented standards for naming conventions, Mapping Documents,
Technical documents and Migration forms.
. Developed ETL Mappings, Sessions, Workflows using various
transformations like Normalizer, Update Transformation, Lookups,
Filters, Routers and Joiners with Excel, flat file and relational data
structures.
. Worked with Memory cache for static and dynamic cache for the better
throughput of sessions containing Rank, Lookup, Joiner, Sorter and
Aggregator transformations.
. Created mappings for Historical and Incremental Loads.
. Used Oracle 10g, SQL, PL/SQL, Procedures, Functions and Complicated
SQL queries using TOAD, SQL developer to truncate the data in the
target tables before loading and tune the ETL Mappings to achieve the
required Performance timing.
. Created tables, views and indexes and worked on data integrity and
created constraints. Also created some procedures that picks-up
relevant data and populate the tables.
. Debugging code, testing and Validated data after processes are run in
development according to business Rules.
. Performance tuning of Mapping and Workflow objects. Also performed DB
partitioning for better performance.
. Supporting daily loads and work with business users to handle rejected
data.
. Prepared and maintained mapping specification documentation.
. Developed Logical Data Flow Diagrams in Visio for every mapping that
is developed.
. Assisted QA Team to fix and find solutions for the production issues.
. Translated the business processes into Informatica mappings for
building the data mart.
. Worked on SQL tools like TOAD to run SQL Queries to validate the
data.
. Created, updated and maintained ETL technical documentation.
. Created Unit Test cases document and reviewed the test case
documents for the interface.
Uniprise Inc. Nov '05 - Sep '07
Hyderabad, India
Senior Software Engineer
'Uniprise Provider Executive Summary Scorecard' aims to develop Operational
Reports and Dashboards for their Provider Call/Contact centre operations.
For this initiative the offshore team requires to build a scalable database
using datastage which will help to summarize and store various dimensions
which then may be used to present the dashboards/linked reports for
management information and analysis.
Environment: Datastage 7.5/EE (Parallel Extender), Oracle 10g, TOAD, DB2-
UDB, Sql server 2005
Responsibilities:
. Developed Datastage server and parallel jobs to extract, transform and
load data into data Warehouse from various sources like relational
databases (DB2), Oracle 10g etc.
. Developed Parallel jobs using Stages such as Aggregator, Compare,
Funnel, Join/Merge, Lookup, Sort, Filter and Transformer.
. Created source table definitions in the Datastage Repository by
studying the data sources.
. Used Datastage Manager for importing metadata from repository, new job
categories and creating new data elements.
. Used the Datastage Designer to develop processes for extracting,
cleansing, transforms, integrating and loading data into data
warehouse database.
. Generations of Surrogate IDs for the dimensions in the fact table for
indexed, faster data access.
. Implemented shared containers to use in multiple jobs, which have same
business logic.
. Created Error Files and Log Tables containing data with discrepancies
to analyze and re-process the data.
. Created job sequences for feeds, schedules to automate the ETL
process.
. Implemented Batch Logic to run the jobs in sequence from .CSV file.
. Implemented Timeline process for eligible members integrating to
Survey data mart. .
. Involved in Developing PL/SQL scripts to validate and load data into
interface tables.
. Used the Datastage Director to monitor the jobs by looking at the Log
files.
. Importing and exporting developed feeds within development, test and
production environments using Datastage Manager.
. Optimized job performance by carrying out Performance Tuning Methods.
Galaxy Software Systems Ltd. Jan '05 - Oct '05
Hyderabad, India
Database Engineer
Worked as Database Engineer in Data Management System (DMS) Project in
Galaxy Software Systems Ltd, using Oracle database as back end. The project
was developed incorporation the principles of the three-tier client-server
model of backend database, application server (Web Server) and the browser
on the client machine. The system is broad based and covers various sectors
of operations.
Environment: Oracle 8i, UNIX (Solaris 2.3), Tomcat 3.0.
Responsibilities:
. Gathered user requirements and followed up by analysis and design.
. Realized use cases by designing class diagrams, sequence diagrams
using UML & Rational Rose.
. Involved with database design including tables and fields using
normalization principles and Entity relationship diagrams.
. Designed and implemented backend Oracle PL/SQL stored procedures and
triggers.
. Extensively worked in the performance tuning of the database.
. Used SQL tools like TOAD to run SQL queries and validate the data.
Education
. Bachelor's degree in Computer Engineering, J.N.T.University, India.