BALAKRISHNA
Email: ****.*******@*****.***
PROFESSIONAL SUMMARY:
Over 9 years of IT experience in all phases of Software Development Life Cycle (SDLC) which includes User Interaction, Business Analysis/Modeling, Design, Development, Integration, Planning and testing and documentation in data warehouse applications, ETL processing and distributed applications
Strong expertise in using ETL Tools Informatica Power Center 9.6 (Repository Manager, Designer, Workflow Manager, Workflow Monitor), IDQ, PowerExchange and ETL concepts.
Experienced in writing SQL, PL/SQL programming, Stored Procedures, Functions, Cursors, Triggers, Views, Materialized Views, indexes, partitions, table partitions and query performance tuning.
Worked with various transformations like Normalizer, expression, rank, filter, group, aggregator, lookup, joiner, sequence generator, sorter, SQL, stored procedure, Update strategy, Source Qualifier.
Data Modeling : Data modeling knowledge in Dimensional Data modeling, Star Schema, Snow-Flake Schema, FACT and Dimensions tables.
Hands on experience in tuning mappings, identifying and resolving performance bottlenecks in various levels like sources, targets, mappings, and sessions.
Extensive experience with Data Extraction, Transformation, and Loading (ETL) from disparate data sources like Multiple Relational Databases (Teradata, Oracle, SQL SERVER, DB2), VSAM, XML and Flat Files.
Experience in working with POWER EXCHANGE to process the VSAM files.
Designing and developing Informatica mappings including Type-I, Type-II, Type-III slowly changing dimensions (SCD).
Coordinating with Business Users, BI teams, functional Design team and testing team during the different phases of project development and resolving the issues.
Worked with Informatica Data Quality (IDQ) tool kit, Analysis, data cleansing, data matching, data conversion, exception.
Performed the data profiling and analysis making use of Informatica Data Quality (IDQ).
Identified and eliminated duplicated in datasets through IDQ components of Biagram Distance, Edit Distance, Hamming Distance.
Worked with various IDQ transformations like Standardizer, Match, Association,Parser, Weighted Average, Comparison, Consolidation,Decision, Expression.
Basic knowledge and understanding of Informatica Cloud.
Experienced in Teradata SQL Programming and Worked with Teradata utilities like Fast Load and Multi Load and Tpump and Teradata Parallel transporter.
Experienced in using advanced concepts of Informatica like PUSH DOWN OPTIMIZATION (PDO),PIPELINE PARTITIONING.
Good hands on experience in writing UNIX shell scripts to process Data Warehouse jobs.
Experience in working with big data Hadoop stack tools like HDFS, HIVE, Pig, Sqoop.
Expert in importing and exporting data into HDFS and Hive using Sqoop.
Experience in performance tuning the HiveQL and Pig scripts.
Applied various techniques at both database level and application level to find the bottle necks and to improve performance.
Good skills in defining standards, methodologies and performing technical design reviews.
Executed software projects for Banking and financial services.
Good communication skills, interpersonal skills, self-motivated, quick learner, team player.
TECHNICAL SKILLS
ETL Tools
Informatica Power Center 9.6, Informatica Data Quality IDQ, Informatica Cloud, Data Stage 8.7, Informatica PowerExchange, Pentaho
Languages
SQL,PLSQL,UNIX Shell Scripting
Methodology
Agile RUP, SCRUM, Waterfall
Databases
Teradata14/13/v2r12/v2r6/v2r5,Oracle11i/10g/9i,DB2,SQLSERVER2005/2008,Netezza
Operating Systems
Windows, UNIX, Linux
IDEs
Eclipse,PL/SQL Developer, TOAD, Teradata SQL Assistant, SQL * Loader, Erwin 3.5
BI Reporting Tools
Crystal Reports, Business Objects, OBIEE
Scheduling Tools
Control-m, Autosys, Tidal
Big Data Technologies
Hadoop, HDFS, Map Reduce, Hive, Pig, HBase, Sqoop, Oozie
Tracking Tool
JIRA,VersionOne .
PROFESSIONAL EXPERIENCE
JOHNSON & JOHNSON HEALTH AND WELLNESS, Detroit, MI Dec’15-Till Now
Role: Senior ETL Lead/Informatica Developer
Description: Johnson & Johnson health and wellness is an organization providing solutions for health assessment, digital health coaching and energy management. Project aims to generating consumer analytics and reporting applications around health behaviors, disease statistics, productivity, and risk prevalence by building data warehouse called FUSION. It also generates Data Extracts contain data for incentive eligibility, claims analysis, clinical studies, and follow-up Digital Health Coaching programming provided directly to third-party administrators.
Responsibilities
Working closely with the business users to understand the requirements and converting them into project level technical capabilities.
Identified and debugged the critical production issues and worked on its resolution.
Supported the Production, UAT, QA, SIT deployments.
Coordinating with offshore team and providing the inputs and technical guidance.
Coordinating the onsite and off-shore teams on a daily basis.
Involved in designing the tables structure and relations based on the requirements and created the new tables based on the design.
Design and Development of ETL mappings using Informatica.
Worked on the Mapping specification documents and provided complex sql queries to QA team which helps in testing data.
Trained the offshore team on Informatica, Pl/SQL, Oracle coding, Tidal, Shell scripting standards and debugging the code.
Provide technical support to ETL applications on Informatica, UNIX, Oracle and Teradata.
Prepared Coding Standards, ETL Build Peer Review Checklist’s and Unit Test Case Templates for different work packages.
Developed the Code following the code quality standards and involved in Team code reviews to make sure the code was developed with all the code quality standards.
Work with various transformations like Normalizer, expression, rank, filter, group, aggregator,
lookup, joiner, sequence generator, sorter, sql, stored procedure, Update strategy, Source Qualifier
Design the Mapplet and reusable transformation according to the business needs.
Worked on the complex sql queries, pl/sql packages, Triggers.
Performance tuning by determining bottlenecks
Design and develop informatica mappings including Type-I, Type-II, Type-III slowly changing dimensions.
Created the Tidal jobs and scheduled them to run daily, weekly, and monthly and to run in hourly intervals to automate the loads
Supported and worked with QA to test the stories in QA environment.
Environment: Informatica 9.6, Teradata 14, Oracle 11i, Teradata SQL Assistant, TOAD, Tidal, Unix, Citrix, JIRA, FIT
WELLSFARGO, San Francisco, CA Mar’15 – Nov’15
Role: Senior ETL Lead/ Informatica Developer
Description: The purpose of this Project is to assess high risk customers based on Customers business activity, ownership structure, anticipated or actual volume and types of transactions. Identifying the customers with different risks and relationships among them .So that they can monitor the suspicious activities by customers and improve efficiencies of KYC (Know your customer) and CAP (Customer Acceptance Program).
Responsibilities
Estimate volume of work and deriving delivery plans to fit into overall planning
Design and Development of ETL mappings using Informatica
Provide technical support to ETL applications on Informatica,UNIX,Oracle and Teradata
Work with various transformations like Normalizer, expression, rank, filter, group, aggregator,
lookup, joiner, sequence generator, sorter, sql, stored procedure, Update strategy, Source Qualifier
Performed the data profiling and analysis making use of Informatica Data Quality (IDQ).
Worked with Informatica Data Quality (IDQ) tool kit, Analysis, data cleansing, data matching, data conversion, exception.
Used reference tables and rules created in Analyst tool.
Used various IDQ transformations like Standardizer, Match, Association,Parser, Weighted Average, Comparison, Consolidation,Decision, Expression
Implement Data Quality Rules using IDQ to check correctness of the source files and perform the data cleansing/enrichment.
Design the Mapplet and reusable transformation according to the business needs.
Performance tuning by determining bottlenecks
Design and develop Informatica mappings including Type-I, Type-II, Type-III slowly changing dimensions.
Environment: Informatica 9.6 (Power Center & Data Quality IDQ), Teradata 14, Oracle 11i, Teradata SQL Assistant, SQL Developer, Unix, Citrix
Liberty Mutual, Boston, MA May 2014 – Feb 2015
Role: Senior Informatica ETL Developer/Hadoop Engineer
Description: This project aims to implement a central customer profile based on a CDI hub that links operational systems to customer data and operates in a closed-loop manner with the analytical systems, enables a complete view of the customer. A 360 degree view of the customer will provide the competitive edge necessary for sustained growth and profitability. The business is not able to correlate customer attributes with revenue, cost or customer satisfaction without excessive effort that is not readily repeatable. This program is intended to provide new customer insight capabilities for Distribution and Consumer Marketing.
Responsibilities
Design & Development of ETL mappings using Informatica 9.1
Provide technical support to ETL applications on Informatica 9.1, UNIX and Oracle
Preparation and Review of Project Macro & Micro design based on the LM solution outline document.
Validating data files against their control files and performing technical data quality checks to certify source file usage
Profiled the data using Informatica Data Quality (IDQ) and performed Proof of Concept.
Worked with Informatica Data Quality (IDQ) tool kit, Analysis, data cleansing, data matching, data conversion, exception.
Used reference tables and rules created in Analyst tool.
Used various IDQ transformations like Standardizer, Match, Association,Parser, Weighted Average, Comparison, Consolidation,Decision, Expression
Involved in designing the Mapplet and reusable transformation according to the business needs.
Designing and developing Informatica mappings including Type-I,Type-II, Type-III slowly changing dimensions(SCD)
Effectively used various tasks (Reusable & Non Reusable), Command, Assignment, Decision, Event Raise, Event wait, Email…..etc.
Identify performance bottlenecks, tuning queries, suggesting and implementing alternative approaches like range partitioning of tables.
Coding & testing the Informatica Objects & Reusable Objects as per Liberty Mutual BI standards.
Attend Technical meetings & discussions.
Prepare High Level and Low Level Design Documents
Worked with Teradata sources/targets.
Used Sqoop to export data from HDFS to Teradata database.
Created Hive managed and external tables.
Performance tuning the hive queries
Created pig scripts to process the files.
Used HDFS system to copying files from local to HDFS file system
Attending onsite/offshore team meetings.
Environment: Informatica 9.1(Power Center & Data Quality IDQ), Teradata 13, Oracle, MS SQL SERVER 2008, OBIEE, Unix, Hadoop 1.1.0 and HDFS, Sqoop, HIVE, Pig
JPMorgan Chase Bank, Columbus, OH Aug 2013 – Apr 2014
Role: Senior Informatica Developer
Description: The Project JP Morgan Chase bank’s CDI (Customer Data Initiatives) team, part of RFS (Retail Financial Services) division recently conducted a Consumer Analytics and Data Strategy (CADS) study to assess current analytical capabilities and supporting data environments in an effort to improve efficiencies and gain greater data accessibility, specifically around consumer, credit and fraud data.
Responsibilities:
Coordinating with Onsite Team and client for Requirements gathering and analysis.
Understanding and developing the JPMC ETL framework for Informatica objects as per coding standards.
Performed the data profiling and analysis making use of Informatica Data Quality (IDQ).
Worked with Informatica Data Quality (IDQ) tool kit, Analysis, data cleansing, data matching, data conversion, exception.
Used reference tables and rules created in Analyst tool.
Used various IDQ transformations like Standardizer, Match, Association,Parser, Weighted Average, Comparison, Consolidation,Decision, Expression
Implement Data Quality Rules using IDQ to check correctness of the source files and perform the data cleansing/enrichment.
Loading data into operational data store.
Designing and developing Informatica mappings including Type-I,Type-II, Type-III slowly changing dimensions(SCD)
Coding & testing the Informatica Objects & Reusable Objects as per JPMC's BI standards.
Participating in peer review of Informatica objects.
Estimating volume of work & Deriving delivery plans to fit into overall planning
Prepared ETL Build Peer Review Checklist’s and Unit Test Case Templates for different work packages.
Involved in Unit Testing, Integration Testing and System Testing.
Environment: Informatica 9.1(PC & IDQ), Data Stage, TeraData 13, SQL SERVER 2008, Teradata SQL Assistant, SQL Developer, UNIX, CITRIX, Business Objects, JIRA
Frost National Bank Oct 2012 – Jul 2013
Role: Senior Informatica Developer
Description: Frost Bank is subsidiary of Cullen/Frost Bankers, Inc. is a financial holding company, operates more than 100 financial centers across southern part of North America, which provides a full range of commercial and consumer banking products, investment and brokerage services, insurance products and investment banking services.
Frost is planning to build a robust consolidated operational data store(ODS ) System in order to build strategic capabilities for customer, account, transactional and profitability information for all lines of business. As part of BI, all the existing production reports to be recreated and new reports to be build based on the current reporting trend.
Responsibilities:
Co-coordinating the onsite and off-shore teams on a daily basis
Developing maps and workflows in Informatica to load data into Teradata.
Loading data into operational data store(ODS).
Used various transformations like Filter, Expression, Sequence Generator, Update Strategy, Joiner, Router and Aggregator to create robust mappings in the Informatica Power Center Designer.
Designing and developing Informatica mappings including Type-I,Type-II, Type-III slowly changing dimensions(SCD)
Unit Testing the maps & workflows.
Coding & testing the Informatica Objects & Reusable Objects as per Frost BI standards.
Extensive performance tuning by determining bottlenecks at various points like targets, sources, mappings and sessions.
Attend Technical meetings & discussions.
Created Complex mappings using Unconnected, Lookup, and Aggregate and Router transformations for populating target table in efficient manner.
Maintaining documentation.
Environment: Informatica 9..1, Teradata v2r12, Oracle 11i, SQL Developer, Teradata SQL Assistant, Crystal Reorts, UNIX, JIRA