Anudeep Dasari
ETL Developer/Data stage
*******************@*****.***
Ph. +1-631-***-****
PROFESSIONAL SUMMARY:
• 9.5 years of experience in complete Software Development life-cycle (SDLC) of various projects, including requirements gathering, system designing, data modeling, ETL development, production enhancement, support and maintenance.
• Familiar with the complete life cycle of testing methodologies. Involved in test plan creation and have extensive experience with unit, integration and regression testing.
• Experienced in creating and providing data for the testing team for both positive and negative testing.
• Experience in writing Test Cases for Source to Target Testing, Production Validation Testing, Data Transformation Testing and Data Completeness Testing.
• Hands-on experience in creating Indexed Views, complex Stored Procedures, and appropriate Triggers for the purpose of efficient data manipulation and data consistency.
• Extensive experience in using highly scalable parallel processing infrastructure using parallel jobs with multi-node configuration files.
• Experience in working with RDBMS like ORACLE and SQL Server.
• Strong experience in working with Business Analysts to understand business requirements and transform requirements into effective technology solutions.
• Strong skills and extensive hands-on experience using SQL, PL/SQL, various Database utilities including SQL Loader.
• Experience in developing Core Java code using oops Concepts.
• Efficient in all phases of the development life cycle, coherent with data cleansing, data conversion, performance tuning and system testing.
• Designed and developed parameters-driven multi-instance DataStage programs and UNIX functions to perform source files validation
• Excellent knowledge in extraction, cleansing and modification of data from/to various data sources, flat files and comma delimited files (.csv).
• Performed benchmarking analysis and performance tuning of ETL processes; implemented code updates to achieve disk space and processing time savings.
• Knowledge across all aspects of Workflow-documenting business processes, analyzing processes, recommending improvements.
• Strong understanding of analytical and reporting workflows. Experience in developing reports.
• Strong analytical and technical exposure and also working effectively as a team member. TECHNICAL SKILLS
ETL Tools : Data Stage, Informatica, Talend,SSIS
Dimensional Data Modeling : Dimensional Data Modeling, Data Modeling, Star Join Schema Modeling, Snow-Flake Modeling, FACT and Dimensions Tables, Physical and Logical Data Modeling, Oracle Designer Databases : Oracle, SQL Server, DB2, Teradata
Analytical and Reporting Tools : Congo’s, Tableau
Environment : Windows, UNIX, Sun Solaris, AIX
Scheduling Tools : Autosys,Tivoli,Control-M
PROFESSIONAL EXPERIENCE
Newyork City Department of finance, New York City,New York ETL/Data stage Developer, Feb 2017-Till Date
NYC DOF administer the tax and revenue laws of the City fairly, efficiently and transparently to instill public confidence and encourage compliance while providing exceptional customer service. Responsibilites:
• Architect and design the whole ETL process from end to end.
• Involved in the creation of dimensional schema design including dimension tables, fact tables and aggregate tables.
• Involved in the high-level and detailed design of the project.
• Created ETL IBM-Data stage parallel jobs to extract and reformat the source data so it can be loaded into the data warehouse schema.
• Designed Data Stage jobs to extract, transform and load data from various sources and archive systems like Oracle, Text files, XML, CSV, IBM DB2, MS SQL Server into Data Warehouses and Data Marts
• Used SQL for data querying and database administration with Oracle database.
• Responsible for monitoring Data stage job development (included code reviews, concept discussions, error handling issues, etc. ), analyzing data stage source conversion/bottleneck issues, and creating overall data flow process. Made several improvements to old standards and helped resolve many performance issues within both Data stage and the Databases used to extract and load data.
• Jobs used all major components within data stage including transformer stages, lookup stages, join stages, modify stages, sequential files, datasets, CDC stages (change data capture),Unstructured Stage etc.
• Performed Unit testing, Integration testing and User Acceptance testing for every code change and enhancements in development, QA, Preproduction and Production.
• Develop Tivoli Jobs, Job Streams, Calendar’s, and using Tivoli Workload Scheduler Environment: IBM Web Sphere Data Stage 11.5, Oracle, SQL,DB2 PL/SQL,Data Warehousing, AIX, Tivoli,Cognos,HPQC,SVN
Novo Nordisk, Plainsboro, NJ Oct 2016-Jan 2017
ETL Developer,
Novo Nordisk discovers and develops innovative medicines and makes them accessible to patients throughout the world. Nearly a century of innovation and leadership has led to the development of a comprehensive portfolio of products and delivery systems for diabetes care, as well as in other therapeutic areas where the company can make a difference.
Responsibilities:
• Interacted with End user community to understand the business requirements and in identifying data sources.
• Analyzed the existing informational sources and methods to identify problem areas and make recommendations for improvement. This required a detailed understanding of the data sources and researching possible solutions.
• Implemented dimensional model (logical and physical) in the existing architecture.
• Migrated the existing report from one project to another project with added functionality.
• Migrated Existing Data stage jobs from Data stage 8.7 to 11.5 and tested all the functionalities to match with the existing functionalities.
• Understanding the PL/SQL code developed to relate the source and target mappings.
• Involved in preparing the mapping document for source to target.
• Designed and developed ETL processes using Data Stage designer to load data from Oracle, MS SQL, Flat Files (Fixed Width) and XML files to staging database and from staging to the target Data Warehouse database.
• Used Data Stage stages namely Hash file, Sequential file, Transformer, Aggregate, Sort, Datasets, Join, Lookup, Change Capture,SCD,Funnel, Peek, Row Generator stages in accomplishing the ETL Coding.
• Developed job sequencer with proper job dependencies, job control stages, triggers.
• Used Quality Stage to ensure consistency, removing data anomalies and spelling errors of the source information before being delivered for further processing.
• Excessively used DS Director for monitoring Job logs to resolve issues.
• Involved in performance tuning and optimization of Data Stage mappings using features like Pipeline and Partition Parallelism and data/index cache to manage very large volume of data.
• Documented ETL test plans, test cases, test scripts using TIMS excel export Tool, and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for testing, error handling and analysis.
• Exported the test scripts from TIMS excel export tool to HPQC and tested the scripts in Test Plan by capturing the Screen shots of all test scenarios executed.
• Used Autosys job scheduler for automating the monthly regular run of DW cycle in both production and UAT environments.
• Verified the Cognos Report by extracting data from the Staging Database using PL/SQL queries.
• Participated in weekly status meetings.
Environment: IBM Web Sphere Data Stage 8.7, 11.5, Oracle, SQL, PL/SQL, Linux, Autosys, Cognos, OBIEE,HPQC
General Motors, Chandler, Arizona Oct 2015 – Sep 2016 ETL Developer
Establish a global common supplier collaboration tool and processes to measure and hold suppliers Accountable for On-Time Shipping (OTS) and other requirements that drive efficiencies, improve part availability for CCA (customer care aftersales) customers, and in support of the MID-TERM Plan (MTP). Responsibilities:
• To Participate in all phases including Requirement Analysis, Client Interaction, Design, Coding, Testing, Support and Documentation.
• Gather and document requirements, requirements analysis, converting requirements into High Level Design Documents.
• Create dimensional model for the reporting system by identifying required dimensions and facts.
• Use Data stage Designer to develop processes for extracting, transforming, integrating and loading Data from various sources into the Data Warehouse database.
• Use Data stage stages like sequential file, Aggregator, Transformer,Dataset,Look up, joiner, Remove Duplicates, sorter, Change Capture and Funnel.
• Designed Jobs to Work on Physical Change Data Capture Data from Source ESPS to target Supplier Collaboration.
• Worked on invoking java code using java client stage and java Transformer stage.
• Create Sequences for all the Data stage jobs which included Job Activity, User Variable Activity, Notification Activity, Terminator Activity and Notification Activity Sequences.
• Schedule the Sequences in Data stage in development and SIT environments. Use Loop Variables, System Variables and Stage Variables in Data stage to design Future Cast reports For the Business Users.
• Lead team of three Developers, updating and discussing the status of the project with managers.
• Working with the testing team to validate and implement Application use cases.
• Create Tasks and Issue in TFS and following the Agile Methodologies.
• Develop staging and Data Mart DS jobs using Data Stage Designer on parallel environment; involve In writing the possible Unit test cases, testing number of jobs, and also resolving the defects in the developed jobs.
• Worked on Outbound file to send it through SFTP to ESPS (Europe) using a Unix Shell Script.
• Developed Autosys Jobs to schedule the execution of Data Stage jobs. Dependencies are set in Autosys to facilitate parallel execution of Data Stage jobs. Environment: IBM Web Sphere Data Stage 11.3, Informatica, JasperETL, Oracle 12c/11g/10g, SQL, PostgreSQL, PL/SQL, Linux,HPQC Autosys
AIG, Jersey City, NJ Aug 2013-Sep2015
ETL Developer
American international group, Inc. (AIG) is a leading international insurance organization serving commercial, institutional and individual customers. Scope of the project was to develop a single point reference data management system by collecting data from myriad AIG systems and Tax Vendors to facilitate AIG wide transactional and tax business. Tax related data collected from Global Finance data repository combined with reference data management system is provided to TaxStream for tax provision and CorpTax for Tax Filing. Responsibilities:
• Designing mappings between source and target.
• Working with the testing team to validate and implement Application use cases.
• Playing a dual role of developer and a tester while creating mock data and working closely with testing team to validate the data,
• Provided information to the testing team by writing the SQL scripts that provide data for validation.
• Set up Data Stage Design Standards and formulated Unit and Integration Test plans.
• Developed DataStage Parallel jobs on IBM Web Sphere Data Stage (IBM 8.1.0) using Parallel Transformer stage, Basic Transformer stage, Aggregator stage, OCI stage, Datasets, Sort stage, Sequential file, Change Capture, merge, remove duplicates, funnel, Join and Lookup Stages.
• Created DataStage Sequencers using Job Activity, User Variables Activity, Execute Command Activity, Notification Activity, Sequencer activity and Nested Condition activity which house Data Stage jobs.
• Extensively used built-in transforms and created user defined subroutines to implement some of the complex logic to meet the business requirements.
• Performance tuned the Data Stage jobs after analyzing the Score Dump Environment Variable.
• Utilized DataStage Import and Export Utilities to migrate the code from one environment to other.
• Developed procedures, functions & triggers considering optimal performance in terms of storing and retrieving data.
• Utilized HP Quality Center for defect logging.
• Developed Shell scripts to facilitate the execution of DataStage Jobs, to validate the incoming flat files, to archive and compress the files.
• Developed Autosys Jobs to schedule the execution of DataStage jobs. Priority codes are set in Autosys to facilitate parallel execution of DataStage jobs.
• Handled production support systems and resolved production issues meeting SLA’s. Environment: IBM Web Sphere Data Stage 8.5, Oracle 11g/10g, SQL, PL/SQL, Linux, Autosys, HPQC, PVCS Chubb Group of Insurance Companies, Warren, NJ May 2011-June 2013 ETL/Datastage Developer
Chubb Group Of Insurance Companies is a General Insurance company, which helps companies manage a wide range of risks and exposures encountered in today’s business environment. The CSD (Claims System Department) in Property and Casualty at Chubb Group of Insurance Companies wanted to build a warehouse to consolidate their data existing in multiple databases across the country. As part of this project, we developed a warehouse for Claims System Department by using Data Stage. Responsibilities:
• Understand the Technical specifications and Mapping Data Items from Source System to the Target System.
• Responsible for testing the data in the database by writing the appropriate scripts and providing data for initial validation.
• Created mock data in the database for both negative and positive testing based on the test cases.
• Used the Data Stage Designer to develop jobs for extracting, cleansing, transforming, integrating, and loading data into data warehouse database.
• Created various local containers and shared containers.
• Developed jobs using various stages-Transformer Stage, column generator Stage, Filter Stage and Sequential File Stage.
• Designed and developed jobs using Parallel Extender for splitting bulk data in to subsets and to dynamically distribute to all available nodes to achieve best job performance.
• Used Data Stage Designer to extract, cleanse, transform, integrate, and load data into the DataMart
• Developed Data Stage Parallel jobs and Shared Containers using Lookup, JOIN, Copy, Change Capture, Aggregator, Column Export, Column Import, Remove Duplicates, Dataset, Transformer, Lookup File Set, Filter, and Funnel Stages.
• Extensively used the various Partitioning Methods like Hash, Entire, Auto, Same, Round Robin to increase the job performance.
Environment: IBM Info Sphere Data Stage 8.1, Oracle 10g,Teradata, DB2 UDB, UNIX, MSOffice Suite, Erwin, MS Visio, PL/SQL, SQL Server, Toad.
Global Trust Bank, Hyderabad, India. October 2009-April 2011 Oracle Programmer
The transaction data was extracted from various branches across the country, which uses a system called GDAS and developed reports for the project, which is the most comprehensive, intranet powered information management system ever available to the ATM Transactions financial Services. The system is meant to efficiently manage and integrate all of their financial, and administrative data Services. FTP component was developed internally. Providing secure, Intranet based ATM Transaction management system. Responsibilities:
• Involved in the creation of database objects like tables, views, packages, procedures
• Involved in data gathering and system functionality requirements study
• Developing Oracle PL/SQL stored procedures, Functions, SQL scripts to facilitate the functionality for various modules
• Developed PL/SQL Procedures, Functions and used SQL loader to load data into the database
• Extensively worked in Oracle SQL, PL/SQL, Query performance tuning, created DDL scripts; Created database objects like Tables, Views Indexes, Synonyms and Sequences
• Developed Packages, customized functions and Triggers based upon the business logics
• Developed Shell scripts and SQL control files to load data through SQL Loader
• Conducted analysis using SAS
• Developed reports using Business Objects Web Intelligence Environment: Oracle, SQL, PL/SQL, SAS, SAP Business Objects, Windows XP and UNIX.