Resume

Sign in

ETL Informatica Talend Consultant

Location:
Jersey City, New Jersey, United States
Posted:
November 14, 2017

Contact this candidate

Himanshu Dixit

Status: H*B valid till Dec **** (I-140 approved)

Synopsis

Dynamic IT professional with around 10 years of extensive working experience in Analysis, Design, Development, Production Support, Enhancement and Implementation of Data Warehouse projects using Informatica PC, Talend DI & BD, Oracle, SQL Server, Netezza, MySQL, DB2 and UNIX.

Experience in roles as - ETL Architect / Technical Lead / ETL Specialist / ETL Lead / Sr. Informatica Developer.

Strong working experience in Investment, Media, Insurance/Annuity domain.

Participated in all phases of SDLC throughout all architectural tiers of the system, experience in adhering software methodologies like Agile, Waterfall.

Very strong experience in Informatica PowerCenter suite which includes Mapping Developer, Workflow Manager, Workflow Monitor, Admin Console and Repository Manager.

Good experience with Big Data and Hadoop ecosystem, Hadoop Distributed File System (HDFS), supported databases like Hive and Pig, installation of Hortonworks.

Strong experience in Talend DI and BD installations and its components like tMap, TJavaRow, tAggregateRow, tFilterRow, tFilterColumns, tJoin, tHDFSGet, tHDFSPut etc.

Certified in Informatica ILM / TDM. Strong experience in Informatica ILM / TDM tool for Data Masking, Data Discovery (sensitive data).

Strong experience in Informatica Metadata Manager, Data Analyst and Informatica Power Exchange.

Expert in design and development of Data Warehouse, Marts and ODS.

Experience in designing the ETL architecture, Load Strategy and Data Flow.

Strong understanding of Data Modeling (Star Schemas and Snow Flake Schemas), Conceptual / Logical / Physical models, DWH concepts, ER diagrams, Data Flow Diagrams / Process Diagrams.

Strong experience in databases like Oracle, DB2, SQL Server and Netezza and database tools like TOAD, AQT, Win-SQL, SQL Developer and Aqua Data Studio.

Extensive experience in Performance Tuning, optimizing ETL process and coming up with the best design to obtain superior performance. Have also worked on creating UNIX shell scripts to automate the ETL process.

Good understanding of PL/SQL Packages, Procedures, Functions, Triggers and other database components. Experience in converting PL/SQL procedures to ETL Interfaces.

Experienced in using transformations like Expression, Filter, Router, Rank, Lookup along with complex transformations like Normalizer, Aggregator, HTTP, Dynamic Lookup, Unconnected lookup, Persistent Cache, SQL etc.

Experience with different types of Sources & Targets – Flat files, Relational tables, VSAM files, XML files etc.

Experience in communication with Informatica Support regarding issues/bugs related to various Informatica tools like PowerCenter, TDM etc.

Good Experience in creation of quality related documents (Function specification, ETL architecture design, High Level and Low-Level design, and Unit Test Case documents).

Excellent analytical skills in understanding client’s organizational structure.

Poses good problem solving and analytical skills. A highly motivated professional, team player and have the ability to effectively work, communicate and mentor people. Have handled team size as big as 10-12 members.

Fast Learner, excellent communication and presentation skills, ability to quickly adapt to the technical environment coupled with very positive user interaction & team spirit.

Educational Qualifications

Bachelor of Engineering, India (2007)

Awards and certifications

Valuable Contribution Award (VCA) – Patni Computers (Now Capgemini)

Certified in Informatica ILM / TDM

Technical Skills

BI & ETL Tools

Informatica PowerCenter 9.x/8.x/7.x, Talend for Big Data, Informatica ILM / TDM 9x, Informatica Metadata Manager 10.1.0, Informatica Data Analyst, Informatica Power Exchange

Databases

Oracle 11g/10g/9i/8i, IBM DB2, SQL Server, Netezza, MySQL,

Big Data – Hive, Pig

Operating System

Windows 7/XP/NT/2000/98/95, Unix

Scripting Languages

UNIX Shell Scripting, SQL, PL/SQL

Job Scheduler

iXp AutoSys, Tivoli Maestro, GECS

Others

Win SQL, TOAD, SQL DBX, AQT, Ipswich, WinSCP, HP Quality Center, MS Office, MS Visio, edit plus, Text pad, Oxygen XML Editor, BMC Remedy Management tool and version controlling tools like Smart SVN, VSS and Calligo.

Domains

Investment, Media, Annuity & Life Insurance

Professional Experience

Project # 1

POC – Talend for Big Data

Client

Time Warner, NYC

Employer

AIT Global Inc

Duration

June 2017 – Till Date

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

Time Warner is a Media and Entertainment company with major divisions as HBO, Warner Brothers and Turner. TW Corporate receives and processes Human Resource data from all its divisions and external vendor, Fidelity.

Responsibilities:

Installed Talend including Talend Administration Center and Talend Integration Cloud Remote Engine.

Created projects and users on Talend Administration Center.

Analyzed the existing DWH processes based on their complexity design and built similar process into Talend ETL platform.

Created mapping documents based on the business requirements to be used by developers and QA team.

Conducted code reviews with the team and suggested code enhancements for better performance.

Worked with testing team to finalize the test schedule and test cases.

Interacted with Hortonworks to resolve Hive and Pig connectivity issues.

Co-ordination with BSA, Business users, On-shore ETL Developers, Offshore ETL Developers, Testing Team, Informatica Admin Team, DBAs etc.

Involvement in Unit Testing of the mappings including integration testing. Using the best document format and best practice processes for its successful execution.

Extensively involved in writing SQL queries (sub queries and join conditions) for building and testing ETL processes.

Created Function specification, High-level design document (ETL design architecture) and Low-level design document and Unit Test Case, deployment and operational document.

Environment: Talend for Big Data 6x, Hortonworks 1.3, Oracle 11g, Hive, Pig

Project # 2

Corporate HRDC Batch Rewrite

Client

Time Warner, NYC

Employer

AIT Global Inc

Duration

June 2016 – June 2017

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

Time Warner is a Media and Entertainment company with major divisions as HBO, Warner Brothers and Turner. TW Corporate receives and processes Human Resource data from all its divisions and external vendor, Fidelity. The primary objective of Corporate HRDC Batch Rewrite project is to convert all the legacy Data Integration programs in Micro Focus Cobol into standard technology for data integration, Informatica, using the best practices of Data Warehousing.

Responsibilities:

Understand the data flow of the current scripts.

Development of Informatica mappings, workflows and database objects wherever needed.

Co-ordination with BSA, Business users, On-shore ETL Developers, Offshore ETL Developers, Testing Team, Informatica Admin Team, DBAs etc.

Involvement in Unit Testing of the mappings including integration testing. Using the best document format and best practice processes for its successful execution.

Extensively involved in writing SQL queries (sub queries and join conditions) for building and testing ETL processes.

Scheduling of ETL workflows through GECS job scheduler.

Created Function specification, High-level design document (ETL design architecture) and Low-level design document and Unit Test Case, deployment and operational document.

Environment: Informatica PowerCenter (v9.6/9.7), Oracle 11g, SQL

Project # 3

POC - Corporate Data Catalog and Business Glossary

Client

Time Warner, NYC

Employer

AIT Global Inc

Duration

June 2016 – Dec 2016

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

Objective of this POC is to create Data Catalog for Time Warner corporate using Informatica Metadata Manager, PowerCenter and Data Analyst.

Responsibilities:

Creation of difference resources in Informatica Metadata Manager for Oracle, SQL Server, Netezza, PowerCenter, Business Objects metadata.

Linking of various resources and running end-to-end lineage views for different attributes.

Interaction with Informatica support regarding the issues with the metadata Manager.

Co-ordination with Business users, Informatica Admin Team, DBA.

Created various design documents and understanding documents for business partners.

Environment: Informatica Metadata Manager 10.1, Informatica data Analyst, Informatica PowerCenter (v9.6/9.7), Oracle 11g, SQL Server, Netezza.

Project # 4

Corporate Data Masking

Client

Time Warner, NYC

Employer

AIT Global Inc

Duration

June 2015 – May 2016

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

In the past several months, there have been several security breaches related to the unauthorized exposure of restricted data. Breaches of masked data are exempt from reporting and penalties.

Time Warner is the central owner of enterprise HR data that includes SSN, Salary, Personal Health Information, and other regulated PII. TW owns the data for domestic and international employees. The applications have personal identifiable information, which creates a potential risk of sensitive data being exposed to larger groups (consultants, offshore resources from vendor partners) during production transfers to non-production environments.

Data masking is a method of creating a structurally similar but inauthentic version of an organization's data that can be used for purposes such as development, software testing and user training. The purpose is to protect the actual data while having a functional substitute for occasions when the real data is not required. The project builds and establishes a solution wherein the data would be de-sensitized, de-identified and scrubbed during production copy to non-production environment.

Project Artifacts:

1. 1500 + tables mask for each application

2. 5000+ attributes are masked in each application

3. ETL – Informatica, Data Masking – TDM

4. Databases – Oracle, SQL Server, Netezza

Responsibilities:

Profiling sensitive information in the database by continuous interaction with the business and system users.

Understand the requirement of the users to Mask the data and identify the pattern of the data to apply the masking logic

Parameterizing all the values used within TDM to increase the scalability of the code across various non-prod environments.

Documenting each Rule and Plan used by every column of a Table for the entire database.

Customization of Mappings .xml file for the features that are yet to be released in the future version of TDM. For ex: Parameterize connections, parameterize seed value etc.

Co-ordination with BSA’s, Business users, On-shore ETL Developers, Offshore ETL Developers, Testing Team, Informatica Admin Team, DBAs and Informatica support team.

Involvement in Unit Testing of the mappings plus integration testing. Using the best document format and best practice processes for its successful execution.

Extensively involved in writing SQL queries (sub queries and join conditions) for building and testing ETL processes.

Created Function specification, High-level design document (ETL design architecture) and Low level design document and Unit Test Case, deployment and operational document.

Environment: Informatica PowerCenter (v9.6/9/7), Informatica ILM/TDM (v9.6/9/7), Oracle 11g, SQL Server, Netezza, SQL

Project # 5

Marketing Staging Area and Client Reporting

Client

OppenheimerFunds, NYC

Employer

AIT Global Inc

Duration

November 2012 to June 2015

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

Responsibilities:

Performed role of Tech Lead / Sr. ETL Developer, that includes requirement analysis, design, development, testing, and documentation and production deployment.

Responsible for participating and providing analytical solutions for the strategic vision of the MSA and CR in the Architecture Meetings.

Participate in design, code reviews, documentation of design, and implementation of methodologies for all databases.

Proof of Concept (POC) for any new applications.

Architect, design and execute strategic projects for IDW and its Marts.

Involved working on Oracle 11g database, designing and creating tables, relations, constraints, indexes.

Development of Informatica mapping using complex transformation including Lookup, Normalizer, Aggregator, Router, Rank, Web Services, HTTP etc.

Experience with different types of sources and targets – Flat files, Relational Tables, VSAM Files, XML files etc.

Fine tune SQL queries and database performance for all applications within the development environment.

Effectively communicate to Business Partners and team members to resolve the problem and the expected time of resolution.

Responsible for resolving critical issues and permanent fix to Production issues.

Responsible for providing timely (weekly) feedback and necessary help/cooperation to refine processes in order to meet business’s expectations

Frequent communication with the client, business, and QA team on technical and functional queries related to critical production issues.

Environment: Informatica PowerCenter (v9.1), Oracle 11g, SQL Server, SQL, UNIX, AutoSys scheduling, Windows 7

Project # 6

MSA and CR Production Support

Client

OppenheimerFunds, NYC

Employer

AIT Global Inc

Duration

November 2012 to June 2015

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

Responsibilities:

Performed role of a Tech Lead/Sr. ETL Developer, which includes requirement analysis, design, development, testing, and documentation and production deployment.

Responsible for participating and providing analytical solutions for the strategic vision of the MSA and CR in the Architecture Meetings.

Provide Analytical Solutions to Production Support problems with my technical and analytical knowledge base.

Participate in design, code reviews, documentation of design, and implementation of methodologies for all databases.

Proof of Concept (POC) for any new applications.

Architect, design and execute strategic projects for IDW and its Marts.

Involved working on Oracle 11g database, designing and creating tables, relations, constraints, indexes.

Fine tune SQL queries and database performance for all applications within the development environment.

Effectively communicate to Business Partners and team members to resolve the problem and the expected time of resolution.

Responsible for resolving critical issues and permanent fix to Production issues.

Responsible for providing timely (weekly) feedback and necessary help/cooperation to refine processes in order to meet business’s expectations

Frequent communication with the client, business, and QA team on technical and functional queries related to critical production issues.

Environment: Informatica PowerCenter (v9.1), Oracle 11g, SQL Server, SQL, UNIX, AutoSys scheduling, Windows 7

Project # 7

IBDW CCV, CMR and SECPRINT Enhancements

Client

MetLife Insurance, NJ

Employer

IGATE (Formerly known as Patni Computers)

Duration

May 2011 to November 2012

Designation

Tech Lead / ETL Specialist / Sr. ETL Developer

The CCV project was in the Holdings part of the Individual Business Data Warehouse (IBDW).

Responsibilities:

Performed role of Informatica Specialist, which includes requirement analysis, design, development, testing, documentation, production deployment and Administrator tasks.

Designed ETL Architecture using Informatica for implementing various phases of EDW build.

Worked on Informatica PowerCenter to develop mappings, sessions and workflows.

Used Informatica Power Exchange tool to read source data and generate feeds to external systems.

Integrated 34 admin systems into IBDW, built stage and stage history layer with error checking strategies.

Built new ETL process and enhancing some of current processes.

Worked on ACORD data model, which is global standard for Insurance domain, built Marts to enable data exchange with external systems.

Involved working on IBM DB2 database, designing and creating tables, relations, constraints, indexes.

Also developed UNIX Shell Scripts to automate the ETL processes.

Used XML sources to load data into relational tables.

Extensively involved in writing SQL queries (sub queries and join conditions) for building and testing ETL processes.

Have built performance-tuned systems with maximum optimization, reusability and ease of operation.

Created Function specification, High-level design document (ETL design architecture) and Low level design document and Unit Test Case, deployment and operational document.

Role also involved leading the onshore and offshore team members, assigning and reviewing their work.

Environment: Informatica PowerCenter (v8.6 / v9.1), Power Exchange, IBM-DB2, SQL, UNIX, Maestro scheduling, Windows XP

Project # 8

MetLife Institutional Data Warehouse L3 Support

Client

MetLife Insurance, NJ

Employer

IGATE (Formerly known as Patni Computers)

Duration

July 2010 to April 2011

Designation

ETL Specialist / Sr. ETL Developer

The project involves production support activities for the data load process for IDW and current Marts and any other data marts that might be sourced from IDW in future. The scope of support covers the code developed for data load processes using Informatica Power Mart, Unix Scripts, UDB SQL scripts and Maestro Schedules.

Responsibilities:

Performing role of a Tech Lead/Sr. ETL Developer, which includes requirement analysis, design, development, testing, and documentation and production deployment.

Responsible for participating and providing analytical solutions for the strategic vision of the IDW in the Architecture Meetings.

Responsible for creating SQL Utility tool which removes DBA intervention in the execution of SQLs in production environment in case of abends/new releases and allows production management team to do so through Maestro job by simply uploading SQL in a file on production server.

Provide Analytical Solutions to Production Support problems with my technical and analytical knowledge base.

Participate in design, code reviews, documentation of design, and implementation of methodologies for all databases.

Proof of Concept (POC) for any new application.

Architect, design and execute strategic projects for IDW and its Marts.

Work with Relational Modelling (Logical and Physical) IDW and Dimensional Modelling for all the Marts.

Involved working on IBM DB2 database, designing and creating tables, relations, constraints, indexes.

Fine tune SQL queries and database performance for all applications within the development environment.

Effectively communicate to Business Partners and team members to resolve the problem and the expected time of resolution.

Responsible for planning the Disaster Recovery Scenarios.

Responsible for resolving critical issues and suggesting the fix/permanent fix to Production Management team (L1/L2).

Responsible for provide timely (weekly) feedback and necessary help/cooperation to refine processes in order to meet business’s expectations

Frequent communication with the client, business, and QA team on technical and functional queries related to critical production issues.

Environment: Informatica PowerCenter (v8.6), IBM-DB2, Oracle 10g, SQL, UNIX, Maestro scheduling, Windows XP

Project # 9

MetLife Capitol Market Investment Product (CMIP)

Client

MetLife Insurance, NJ

Employer

IGATE (Formerly known as Patni Computers)

Duration

July 2010 to April 2011

Designation

ETL Specialist / Sr. ETL Developer

This project involves the migration of Capital Market Investment Products from FMS2 onto the MUREX application currently being utilized by the Investments department.

Responsibilities:

Performing role of a Tech Lead/Sr. ETL Developer, which includes requirement analysis, design, development, testing, and documentation and production deployment.

Working on Informatica PowerCenter to develop mappings, sessions and workflows.

Responsible for the planning and management of all technical and operational activities.

Requirement analysis, Logical and Physical database design. Architect, design and execute strategic projects for IDW and its Marts.

As a designer, he was responsible for designing and coding of various key modules, which includes creation of Analysis document, Function Specification, Mapping document, HLDs, LLDs and UTCs, KT document for QA Team and for production rollout.

Responsible for updating the latest versions of the entire project related documents in Calligo.

Worked on PowerCenter client tools like Source Analyzer, Warehouse Designer, Mapping Designer, Mapplet Designer and Transformations Developer.

Developed mappings for extracting data from various sources involving flat files.

Mapping business requirements to technical specifications and design logic for implementing the same. Developed data mappings using various transformations between source systems and warehouse using Informatica Designer.

Created sessions using Informatica PowerCenter workflow Manager.

Involved testing and UAT support activities.

Responsible for planning the disaster recovery scenarios.

Running integration test cycles and integration testing.

Involved working on IBM DB2 database, designing and creating tables, relations, constraints, indexes.

Extensively involved in writing SQL queries (sub queries and join conditions) for building and testing ETL processes.

Frequent communication with the client, business, QA team on technical and functional queries.

Generic Error Strategy implementation.

Automated the process through UNIX Shell scripting.

Environment: Informatica PowerCenter (v8.6), IBM-DB2, Oracle 10g, SQL, UNIX, Maestro scheduling, Windows XP

Project # 10

IBDW CCV, CMR and SECPRINT Development

Client

MetLife Insurance, NJ

Employer

IGATE (Formerly known as Patni Computers)

Duration

January 2009 to July 2010

Designation

ETL Specialist / Sr. ETL Developer

Responsibilities:

Performing role of a Sr. ETL Developer, which includes requirement analysis, design, development, testing, and documentation and production deployment.

As a designer, he was responsible for designing and coding of various key modules, which includes creation of Analysis document, Function Specification, Mapping document, HLDs, LLDs and UTCs, KT document for QA Team and for production rollout.

Responsible for updating the latest versions of the entire project related documents in Calligo.

Developed mappings for extracting data from various sources involving flat files.

Mapping business requirements to technical specifications and design logic for implementing the same.

Developed data mappings using various transformations between source systems and warehouse using Informatica Designer.

Developed different ETL strategies like Set Processing, Error Strategy, Polling and Reconciliation, Error Processing and Re-processing.

Created sessions using Informatica PowerCenter workflow Manager.

Building Generic code for quick implementation.

Performance Tuning of Sessions and Mappings to meet the SLA of almost 25 systems required for this critical project.

Environment: Informatica PowerCenter (v8.6), Power Exchange, IBM-DB2, SQL, UNIX, Maestro scheduling, Windows XP

Project # 11

MetLife IB Platform Services Team – Production Support

Client

MetLife Insurance, NJ

Employer

IGATE (Formerly known as Patni Computers)

Duration

Sep 2008 to Dec 2008

Designation

ETL Specialist / ETL Developer

The purpose of this project is to build a data warehouse for Individual Business that will have information that spans several subject areas, including Compliance, Sales, Policy, Product and Party/Organization, MetLife Bank, EDW (Enterprise Data Warehouse), LDW (Legacy Data Warehouse).

Responsibilities:

Monitoring and reporting through Maestro.

Worked on troubleshooting the mappings to improving Performance by identifying bottlenecks.

Have a constant follow-up whether the ABEND or SLA for each task is been logged on to the defect tracking system.

Participate in design, code reviews, documentation of design, and implementation of methodologies for all databases.

Environment: Informatica PowerCenter (v8.6), Power Exchange, IBM-DB2, Oracle 9i, SQL, UNIX, Maestro scheduling, and Windows XP

Project # 12

Project M-Warranty

Client

Volkswagen of America

Employer

IGATE (Formerly known as Patni Computers)

Duration

February 2008 to August 2008

Designation

ETL Developer

Responsibilities:

Used Informatica PowerCenter for (ETL) extraction, transformation and loading data from heterogeneous source systems into target database.

Created Mappings using Mapping Designer to load the data from various sources into multiple targets, using different transformations like Aggregator, Expression, Filter, Lookup, Sequence Generator, Router and Update Strategy.

Update all the project related documents in VSS, the project-tracking tool.

Testing & debugging

Environment: Informatica PowerCenter (v7.1), Oracle 10g, SQL, UNIX, Windows XP

REFERENCES: Available upon request



Contact this candidate