Post Job Free
Sign in

Data Quality

Location:
Atlanta, GA, 30328
Posted:
April 12, 2010

Contact this candidate

Resume:

Professional Summary

Data Warehousing Developer with involvement in Analysis, Estimation,

Architect, Design and proficiency in Administration, Development, Build,

Deployment and Support of project types like Data Warehouse, Data

Migration, Data Quality and Master Data Management in Service - Oriented

architecture requiring deep Business Knowledge, Process Optimization and

Business Operations Knowledge to achieve significant and measurable

business value.

Technical Skills

Databases Oracle, SQL Server, Mainframe DB2, DB2-UDB

Data Integration Informatica PowerCenter, Informatica

PowerExchange, DTS Packages

Data Quality Informatica Data Explorer & Data Quality,

Dataflux(SAS)

Operating Systems Unix, Windows, Linux, Mainframe

Business Process C, Pro*C, Java, Unix Shell Scripting, Windows

Development scripting

Enterprise Application Soap, XML, WSDL, Webservices, Java Messaging

Integration Services, Biztalk, SeeBeyond, Control M*

Others HP Quality Center, Mercury Time Management,

SFTP, Storage Area Network (SAN), Network

Attached Storage (NAS), PVCS, VSS, Windows

Cluster*

Certifications & Trainings

* Informatica Certified Developer

* Informatica Certified Data Quality Professional

* Informatica Certified Specialist

. Data Explorer(IDE)

. Data Quality(IDQ)

* Dataflux Trained Data Quality Professional

* Oracle Certified Associate*

Experience Summary

V Performed ETL development activities in highly integrated environments

by utilizing proficiency in Informatica PowerCenter, PowerExchange,

Plugins... with expertise in handling UTF-8 and other ornamental

characters

V Implemented, Deployed and Supported synchronization of operational,

and transactional systems with high-quality data in batch, near real-

time, and real-time modes in quick turn around times while maintaining

technical & functional documentation and coding standards

V Developed various proof of concepts and presented architectural and

cost effective changes to technical and business users for

implementation of new solutions with existing and/or new

hardware/software resources to get best out of available resources and

budget in client's environment

V Performed ETL Administration activities for Development, QA, Staging

and production environments (with High Availability, Load Balancing,

Fail Over) including but not limited to Build Mgmt, Performance &

Standards Mgmt, Installations, Configurations and upgrades of

Repository, Rep Servers and PowerCenter Servers

V Participated in Design and Estimation of Hygiene and Match system to

process customer's data from 232 countries and developed Architect and

Monitoring jobs to parse Contact names and cleanse Business names and

personal information like e-mail, phone etc which required

customization of Dataflux -QKB (Quality Knowledge Base)

V Initiated the idea and exposed Web Services of Dataflux(SAS) to

formulate self sufficient tool for Data Stewards helping them to

decide upon Data Quality and Matching Matrices on a web portal and

integrated it with Informatica in transactional and batch mode

V Participated actively in profiling of project source data to confirm

consistency of target model with legacy systems (Data Modeling) and to

provide the detailed report including but not limited to accuracy,

cleanliness and redundancy of data to be loaded into target Data

Warehouse

Employment History

V Technical Specialist at Wipro from 2007 - till date

V Project Engineer at Wipro from 2006-2007

Awards and Recognitions

V Client appreciated STAR award(s)

. Informatica upgrade from 7.x to 8.x

. Support for various applications

V Wipro Awarded FIMC award(s)

. Developing and maintaining critical applications.

. Excellent contribution to GHM project which helped to achieve

PCSAT rating 7/7

V Rated stupendous performer in the last 2 annual appraisal cycles

Professional Experience

Customer Management Application

AGL Resources Ltd

A leading Natural Gas company with more than 2.3 million customers needs a

cost effective customer data warehouse that will house all customer,

product, billing, service, and sales information of its natural gas

customers to effectively integrate and analyze customer and market trends

information. Customer center executives place an order on behalf of

customer on the web portal which is picked by Biztalk and messages are sent

to Automatic Dispatch (AD) and to Customer Information System (CIS) where

most of the business logic is implemented and is replicated back

(maintaining the historic data) to CMA application which is easy to use

human interactive web portal.

Environment: Windows, Mainframes, Informatica PowerCenter 8.x (for ETL),

Informatica PowerExchange 8.x (for Real Time Change Data Capture),

Mainframe DB2 (for Legacy DB), Microsoft SQL Server 2005(for Staging and

Data Warehouse), Biztalk (for Orchestration), Transidiom (as Web services),

SAN, Window Clusters, MSMQ.

Responsibilities & Contributions:

. Managing PowerCenter Domain, Nodes, Service Manager and Application

Services by configuring High Availability, Grid etc to ensure disaster

recovery and failover strategies for the Data Integration Environment

. Developing repository object structure and managing user and user

group's access to objects in repository and contribution to technical

and system architectural planning by testing and implementing new cost

effective technical solutions providing higher performance with

available resources

. Implemented alerts, monitoring and exceptional handling by introducing

Informatica WebServices and e-mail services and educated technical end

users to trigger and stop workflows from an easy to use portal

. Published coding standards, build management and deployment documents

for Code movement, repository & domain backup/restore and regulated

them for all ETL applications to streamline processes in a time

efficient manner

. Developed and maintained quick automated solutions to check the data

inconsistencies in various environments and correct them as required

using windows scripting and Informatica

. Expanded scope of Data Warehouse by migrating more truth from Customer

Information Management System (Mainframe) to Windows using

PowerExchange

Global Hygiene and Match

Dun & Bradstreet

An industry leading software giant receiving information from 232 different

countries needs a centralized data warehouse to hygiene the inbound

information and perform Name and Address matching for unique data

identification purposes. Information is passed to the vendors in batches (a

compressed tar file containing binary image file of each table); each batch

carrying a priority and performance SLA within which it has to be completed

and delivered back. All this information is accumulated in the data

warehouse after applying hygiene rules. The data warehouse is currently

still being seeded on country-by-country basis and is expected to grow to

800 Million records of master data.

Environment: HP - UX, Informatica PowerCenter (for ETL), Informatica Data

Explorer( Profiling and Modeling), See Beyond (for EAI), Data Flux

(Cleansing and Matching), First Logic (Non-US Address Cleansing), Quality

(US Address Cleansing), Oracle 10g (Data Warehouse), DB2 UDB (Inbound and

Outbound Stage), Java & PL/SQL (for extracting compressed tar file), Bulk

Data Exchange FTP (for interchanging the batches between vendors and the

customer), NAS, Control-M and JMS Queues.

Responsibilities & Contributions:

. Involved in estimating & planning architectures and infrastructures in

support of data management processes & procedures and procurement of

design document while maintaining logical/physical data models

. Involved in designing and developing Various Hygiene and Match Rules

required for Business & Contact Name, Address and personal areas like

E-mail, phone, web address...which includes but not limited to

understanding the customer's requirement and presenting

profiling/audit results, in summary and in detail

. Used the Informatica Data Integration platform to extract, transform &

load data by development and review of mapping design, workflows and

load processes while ensuring adherence to coding standards and

documentation

. Lead Development and maintenance of DataFlux Architect, Profiling and

Monitoring Jobs despite the fact that I took complete ownership for

customization of Quality Knowledge Base which included upgrade &

modification of Scheme, Regex & Vocab files across various locales to

accommodate multiple customers by utilization of Macro Variables.

. Reported and automated exceptions handling and other procedures

critical to system by writing code in Pro*C, Java, XML, UNIX shell

scripts and in Expression node/transformation of Dataflux/Informatica

Power Center

. Exposed Web Services of Dataflux for demonstrating cleansing, parsing

& matching of various Business components. Integrated these services

with Informatica PowerCenter in real time and batch modes

Educational Background

. B.E in Electronics and Communication Engineering from University of

Rajasthan, India

Contact Details

Current Location Atlanta, Georgia

Mobile 484-***-****

Email *************@*****.***

Public Profile http://www.linkedin.com/in/gur

preetserpikhi



Contact this candidate