Post Job Free

Resume

Sign in

Mountain Energy Hadoop Data

Location:
Pittsburgh, PA
Posted:
December 12, 2022

Contact this candidate

Resume:

Aniket Pandurang Katkar

+1-412-***-****

adt0pg@r.postjobfree.com

Education

Having Bachelor’s degree in Information Technology from Shivaji University, Kolhapur

Professional Summary

Over 10 years of total experience in IT Industry with BI tools including Actuate, SAP Crystal Reports QlikView, Tableau visualization tools and have 2.5 years of experience with Hadoop Ecosystems like Apache PySpark SQL, Apache Hive, Impala, Hadoop HDFS, Big Data Hadoop Ecosystem.

Experience in analysis, design, implementation and testing of various business applications including extensive experience in Web Reporting using Actuate e.Reporting Suite (Versions 7.x/8.x/9.x) and QlikView and Tableau Dashboards and writing SQL queries.

Build data pipelines using Spark SQL/Data frames.

Utilize Hadoop Data and build data pipelines using Apache PySpark APIs using different transformations and load data in multiple layers in Hive table partitions/buckets. Data cleansing, Date formatting and keeping history data is part of the daily process. Automate spark jobs running on Daily/Monthly/Quarterly basis and presented extracts in Tableau for visualization using Impala

Interacted with the business users to Create and document report specifications.

Enormous experience in Software Development Life Cycle (SDLC) including requirements and systems analysis, design, programming, implementation and application maintenance.

Cognitive about designing, deploying and operating highly available, scalable and fault tolerant systems using Amazon Web Services (AWS).

Good knowledge of software engineering concepts, techniques, development methodologies and documentation.

Good experience on project documentation like weekly status report and user manuals.

Interacted with Business Analysts and software developers for bug reviews and participated in QA meetings.

Experience in managing and coordinating team size of 4-7. Also conducted thorough knowledge transition of team members to cater business requirements.

Excellent communication, interpersonal and analytical skills and highly motivated team player and ability to work individually as well.

Ability to learn and adapt quickly to emerging new technologies and paradigms. Learnt many technologies on job as per the project requirement. Trained in Informatica Powercenter ETL tool.

Have basic awareness on UNIX commands.

Areas/Applications

Reporting and ETL Tools

QlikView,SAP Crystal Reports, Actuate-7.x/8.x/9.x,Tableau,Informatica

Big Data Hadoop Ecosystem

Apache Spark, Apache Hive, Impala, HDFS, PySpark SQL

Database

Oracle 9i, SQL Server,Hive and Impala Query Editors

Languages

C, C++, VB, SQL, PL/SQL, PHP, Joomla 1.5 Framework, Python Spark API

Operating Systems

Windows 2000/2003 Server, NT, UNIX

Certifications

Tableau Desktop Specialist, AWS Cloud Practioner

Career Profile

Since 2011-07-13 Tata Consultancy Services

Project Name: AML Processing on Hadoop for PCM4 (Portfolio Command Modules 4)

Period February 2021– Present

Client Name PNC Financial Services

Position Hadoop Spark Developer/Tableau Report Developer

Project Descriptions:

We have a Hadoop Data Lake now with history data for almost 2 years. Hadoop Data Lake has multiple layers like Staging layer, Cleansed layer, Integrated layer and presentation layer. Multiple tables can be utilized for reporting and query building using PYSPARK and displaying them on dashboard like Tableau which can give deep insights to the client about the customer trend analysis on AML grounds

This project consists of designing and building AML portfolio monitoring, visualization, decision support tool, analysing portfolio segments and transaction trends. Objectives of PCM is to centralize and standardize AML data, provide a single source of AML information that will allow AML stakeholders more efficiently identify, investigate, track and report on AML risks. Consolidate and industrialize existing AML reporting capabilities & create a set of customized intuitive ‘point and click dashboard’ and report for key AML stakeholders. The scope of this project is divided into three categories. Each of the scope is described below.

oDrastic reduction/elimination of time spent by AML investigators sourcing and assembling data.

oCreation of sustainable data capture/data quality remediation process and visualization through Tableau for below different portfolio modules.

oRetail Business Banking: DDA Accounts

oRetail Business Banking: Money Market Accounts

oRetail Business Banking: Lending Accounts

Responsibilities

Created project requirement specification document including detailed design and high-level document.

Build and deploy Data Pipelines using PYSPARK. Created aggregated data and using pyspark dataframes and created the low latency Tableau extracts using impala and scheduled them to run on month basis.

Participate in process improvements such as addressing Hadoop Small Files issue, Tuning and Memory optimization of spark jobs, Right file formats for effective cluster utilization and creating reusable frameworks. Built optimized spark-submits jobs and clubbed them in a conf file to populate data on multiple layers.

Attending daily client meetings and providing status of the overall progress of the pipelines, blockers (If any) and present prototype demo at the end of each sprint. As both Lead and a developer my work is to build the code and also to ensure design/build/test/go live of the modules are within timelines and without extension or slippage by assisting my team in the challenges they come across during any phase.

Created a version deploy code using GIT/UDeploy version control.

Created action filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.

Restricted data for users using permissions access and User filters.

Developed Tableau visualizations and dashboards using Tableau Desktop.

Collaborate with the customer to finalize and clarify the specifications in association with the technical skills to meet the evolving project requirement.

Ensure that all technical deliverables are aligned with the solution architecture & design and meet the functional and non-functional requirements.

Responsible for development of report which determines the month over month change in metrics of the customer and account, apply ranking as well as the biggest movers over months.

Enhancing the performance tuning of the dashboards by analysing the query with data volumes.

Responsible for co-ordinating with Tableau Admin team and supporting them perform administrative tasks/activities.

Ensuring appropriate and adequate unit test cases are created and enacted. Participate in customer interactions to provide daily insights and updates on the current status of the project

Project Name: Anti Money Laundering-NRA

Period February 2020 – December 2020

Client Name PNC Financial Services

Position PySpark Developer

Project Description:

(Risk based model is created for NRA customers using feature set extraction. Earlier this process was manual. With the changes in regulatory requirements client decided to automate the process for NRA customers using Big Data Hadoop Technology)

Key Responsibilities

Created Requirement Specification document. Absorb and analyze the business requirements with the customer in order to formulate design decisions. Translate business needs and requirements into application programs using Hive/PySpark.

Involved in SIT and UAT of code developed preparing documents required for the project like technical design, status report, etc

Worked on Apache spark dataframes and RDDs to create the pyspark scripts for reporting and visual needs in dashboard for business.

Converted scripts developed in Oracle PLSQL into Hadoop using PySpark. Used/Modified reusable Pyspark scripts/framework to load the data from Hadoop Datalake in successive layers and for reports generation/Tableau.

Involved in Hive table design for performing DDL and DML operations and loading the source tables, applying transformations such as Filtering/Aggregation/Adding columns and applying operations over them/Date Format conversion using python functions/Partitioning Data/ and finally dumping into target hive tables in successive layers using reusable PySpark frameworks.

Created low latency extracts for Tableau reporting for front end business using impala instead of hive as a data source.

Project Name: AML Processing on Hadoop for PCM3 (Portfolio Command Modules 3)

Period February 2018– August 2019

Client Name PNC Financial Services

Position Tableau Report Developer

Project Descriptions:

This project is part of PNC’s AML Bank Secrecy program which helps to build Hadoop platforms that can deliver an array of powerful analytics and process engines which works seamlessly with existing infrastructure, providing core functions of data governance, security, backup,archives and system management, with all flexibility and extensibility.

This project consists of designing and building AML portfolio monitoring, visualization, decision support tool, analysing portfolio segments and transaction trends. Objectives of PCM is to centralize and standardize AML data, provide a single source of AML information that will allow AML stakeholders more efficiently identify, investigate, track and report on AML risks. Consolidate and industrialize existing AML reporting capabilities & create a set of customized intuitive ‘point and click dashboard’ and report for key AML stakeholders. The scope of this project is divided into three categories. Each of the scope is described below.

oDrastic reduction/elimination of time spent by AML investigators sourcing and assembling data.

oCreation of sustainable data capture/data quality remediation process and visualization through Tableau for below different portfolio modules.

oATM, Branch & Cash (ABC)

oZelle PCM

oCash Vault Gen3

Responsibilities

Created action filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.

Restricted data for particular users using permissions access and User filters.

Developed Tableau visualizations and dashboards using Tableau Desktop.

Collaborate with the customer to finalize and clarify the specifications in association with the technical skills to meet the evolving project requirement.

Ensure that all technical deliverables are aligned with the solution architecture & design and meet the functional and non-functional requirements.

Responsible for development of report which determines the month over month change in metrics of the customer and account, apply ranking as well as the biggest movers over months.

Enhancing the performance tuning of the dashboards by analyzing the query with data volumes.

Responsible for co-ordinating with Tableau Admin team and supporting them perform administrative tasks/activities.

Ensuring appropriate and adequate unit test cases are created and enacted. Participate in customer interactions to provide daily insights and updates on the current status of the project

Since 2011-07-13 Tata Consultancy Services

Project Name: PCM 2 (Portfolio Command Modules 2)

Period July 2017 – January 2018

Client Name PNC Financial Services

Position Tableau Report Developer

Project Descriptions:

This project consists of designing and building AML portfolio monitoring, visualization, decision support tool, analysing portfolio segments and transaction trends. Objectives of PCM is to centralize and standardize AML data, provide a single source of AML information that will allow AML stakeholders more efficiently identify, investigate, track and report on AML risks. Consolidate and industrialize existing AML reporting capabilities & create a set of customized intuitive ‘point and click dashboard’ and report for key AML stakeholders. The scope of this project is divided into three categories. Each of the scope is described below.

oAbility to generate Quarterly Risk Management (QRM) reports in fully automated end-to-end process which is previously was a manual process and required weeks and multiple teams to prepare this report.

oDrastic reduction/elimination of time spent by AML investigators sourcing and assembling data.

oCreation of sustainable data capture/data quality remediation process and visualization through Tableau for below different portfolio modules.

oForeign Correspondent Bank (FCB)

oDomestic Correspondent Bank (DCB)

oAsset Management Group (AMG)

oNassau-Bahamas EuroDollar Accounts.

Responsibilities

Created action filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.

Restricted data for particular users using permissions access and User filters.

Develop and maintain Tableau visualizations and dashboards using Tableau Desktop.

Enhancing the performance tuning of the dashboards by analyzing the query with data volumes.

Responsible for co-ordinating with Tableau Admin team and supporting them perform administrative tasks/activities

Collaborate with the customer to finalize and clarify the specifications in association with the technical skills to meet the evolving project requirement.

Ensure that all technical deliverables are aligned with the solution architecture & design and meet the functional and non-functional requirements.

Responsible for development of report which determines the month over month change in metrics of the customer and account, apply ranking as well as the biggest movers over months.

Ensuring appropriate and adequate unit test cases are created and enacted. Participate in customer interactions to provide daily insights and updates on the current status of the project

Since 2011-07-13 Tata Consultancy Services

Project Name: PCM 1 (Portfolio Command Modules 1)

Period July 2017 – January 2018

Client Name PNC Financial Services

Position Tableau Report Developer

Project Descriptions:

These PCM dashboards were designed for Enterprise AML Executive team which provides

oAllow decision makers to turn off the traditional transaction monitoring reports without losing the visibility of customers funds transfer/ card activities.

oDrastic reduction/elimination of time spent by AML investigators sourcing and assembling data.

oCreation of sustainable data capture/data quality remediation process and visualization through Tableau for below modules.

Commercial Credit Card

Consumer Credit Card

Non-Residential Alien Customers

Remote Deposit Capture

Responsibilities

Created action filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.

Restricted data for particular users using permissions access and User filters.

Developed Tableau visualizations and dashboards using Tableau Desktop.

Collaborate with the customer to finalize and clarify the specifications in association with the technical skills to meet the evolving project requirement.

Ensure that all technical deliverables are aligned with the solution architecture & design and meet the functional and non-functional requirements.

Responsible for development of report which determines the month over month change in metrics of the customer and account, apply ranking as well as the biggest movers over months.

Ensuring appropriate and adequate unit test cases are created and enacted. Participate in customer interactions to provide daily insights and updates on the current status of the project

Since 2011-07-13 Tata Consultancy Services

Project Name: AVS (Advanced Visualizations and Simulations)

Period July 2017 – January 2018

Client Name PNC Financial Services

Position Tableau Report Developer

Project Descriptions:

This project consists of designing and building AML portfolio monitoring, visualization, decision support tool, analysing portfolio segments and transaction trends. Objectives of AVS is to centralize and standardize AML data, provide a single source of AML information that will allow AML stakeholders more efficiently identify, investigate, track and report on AML risks. Consolidate and industrialize existing AML reporting capabilities & create a set of customized intuitive ‘point and click dashboard’ and report for key AML stakeholders. The scope of this project is divided into three categories. Each of the scope is described below.

oAbility to generate Quarterly Risk Management (QRM) reports in fully automated end-to-end process which is previously was a manual process and required weeks and multiple teams to prepare this report.

oDrastic reduction/elimination of time spent by AML investigators sourcing and assembling data.

oCreation of sustainable data capture/data quality remediation process and visualization through Tableau.

Responsibilities

Created action filters, parameters and calculated sets for preparing dashboards and worksheets in Tableau.

Restricted data for particular users using permissions access and User filters.

Developed Tableau visualizations and dashboards using Tableau Desktop.

Collaborate with the customer to finalize and clarify the specifications in association with the technical skills to meet the evolving project requirement.

Ensure that all technical deliverables are aligned with the solution architecture & design and meet the functional and non-functional requirements.

Responsible for development of report which determines the month over month change in metrics of the customer and account, apply ranking as well as the biggest movers over months.

Ensuring appropriate and adequate unit test cases are created and enacted. Participate in customer interactions to provide daily insights and updates on the current status of the project

Since 2011-07-13 Tata Consultancy Services

Project Name: WoodSide Energy

Period August 2016 – July 2017

Client Name Woodside Energy

Position Tableau Report Developer

Responsibilities

Scope of the project is to develop a dashboard for different Business Units for incidents raised.

Involved in scripting/Designing of the project.

Data Fetch from different sources like SQL,MS Excel files.

Creation and Loading of TDE with section access

Managed reports by making suggestions and proposing actions.

Create reports are based on the business user requirements.

Scope of the project is to support existing dashboards by providing feasible solutions for incidents raised.

Operating System Windows OS

Languages VizQL

Special Software Tableau, SQL Server, Service Manager

Pune

Since 2011-07-13 Tata Consultancy Services

Period July 2011 – July 2016

Client Name Humana Inc.

Position Report Developer

Project Description

HUMANA Inc., founded in 1961 in Louisville, Kentucky, with a customer base of over 11.5 million in the United States. The company is the largest (by revenues) Fortune 500 company and has a market cap of over US $13 billion, $25.2 billion in revenue, and over 26,000 employees nationwide. Humana markets its health benefit consumer services in all 50 states, D.C., Puerto Rico and has international business interests in Western Europe.

Scope of the project is to perform enhancements for its Sales Incentive Compensation plans and new products in 2010 Design Project which includes three processing units, Senior Products, Group Medicare Products and Commercial Products. Senior products are government sponsored plans for Senior citizens. Commercial products include plans for individuals. Humana One is part of Commercial products where new products are added. Group Medicare products are for a group of Individuals or Organization.

Responsibilities

Scope of the project is to develop a simple dashboard for Commercial and Retail Business Units for incidents raised.

Involved in scripting/Designing of the project.

Data Fetch from different sources like SQL,MS Excel files.

Creation and Loading of QVD with section access

Managed reports by making suggestions and proposing actions.

Create reports are based on the business user requirements.

Scope of the project is to perform enhancements for its Sales Incentive Compensation plans which includes three processing units, Senior Products, Group Medicare Products and Commercial Products.

Project Callidus EIM

Operating System Windows OS, UNIX

Languages VizQL

Special Software SAP Crystal Reports XI, SAP BO CMC, QlikView, Oracle9i,Informatica

Hyderabad, Pune

February 2010 – June 2011 SEEInfobiz Pvt.Ltd

Period February 2010 – June 2011

Client Name Vodafone Essar Private Limited

Position Report Developer

Project Description

DISHA EBP Project is migration project from BSCS billing system to AMDOCS billing system. In BSCS billing system the input files are received in TIMM format, but in AMDOCS billing system these will be replaced with Flat Files (i.e. .txt format).

Responsibilities

Handle DISHA EBP application for Vodafone Essar for its 8 circles across India.

Deployed the billing reports on management portal (Web) with proper security & accessibility.

Arrange Flat Files in the specified format to generate Actuate instance report (Bills) using bursting & log files on UNIX servers.

Generate Postscripts according to Bills generated.

Populate database with consolidated values of Bills. Using this database, generate Company Health Report & Bill Matrix Report.

Analyses data & Error handling.

Develop Actuate report for sending bill via Email and Fax.

Deployed new logic by using Actuate function to improve the performance of report generation.

Develop new design to check the validation of generated log with source file

Migration of Electronic Bill Presentment Process on Clustering Environment. An Actuates clustering feature enables using the processing power of multiple servers at the same time. This type of scalability provides better performance for the future.

Implement new course of action to make everything dynamic and smooth.

Project DISHA EBP (Electronic Bill Processing)

Operating System Windows OS,UNIX

Special Software Actuate e Report Design Professional, iServer, UNIX, Oracle 9i

Mumbai,Pune

September 2008 – January 2010 Rainbow Infotech India Pvt.Ltd

Period September 2008 – January 2010

Client Name Green Mountain Energy

Position Developer

Project Description

Green Mountain Energy is web based application for the company who is the USA’s leading retail provider of cleaner energy and carbon offset solutions. Green Mountain offers residential, business, institutional and governmental customers an easy way to purchase cleaner, affordable electricity products, as well as the opportunity to offset their carbon footprint.

Responsibilities

Responsible for requirement gathering from different Business users as well as application users.

Was responsible for creating specifications covering functional, technical design of different reports.

Worked on development, testing and rollout of various modules.

Project Green Mountain Energy

Operating System Windows OS

Special Software PHP, Joomla 1.5 Framework, MySQL

Pune

September 2008 – January 2010 Rainbow Infotech India Pvt.Ltd

Period September 2008 – January 2010

Client Name City of Corpus Christi, Texas

Position Report Developer

Project Description

City Of Corpus Christi is Texas government providing water services, health department, and other commonly used services. This project involved Report Development using Actuate.

Responsibilities

Responsible for requirement gathering from different Business users as well as application users

Was responsible for creating specifications covering functional, technical design of different reports

Worked on development, testing and rollout of various reports

Involved in development of Actuate Reports.

Developed various reports including master detail, Cross-Tab, Sub-reports, Sequential and Parallel.

Used various features of Actuate like Data Filters, Single Input Filter, Memory Data Sorter, Dynamic Frames and Controls.

Project City of Corpus Christi

Operating System Windows OS

Special Software Actuate e Report Design Professional, iServer, Oracle 9i

Pune

September 2008 – January 2010 Rainbow Infotech India Pvt.Ltd

Period September 2008 – January 2010

Client Name -Mosaic, Bartow, FL (Phase II).

Position Report Developer

Project Description

Mosaic is one of the world's leading producers and marketers of concentrated phosphate and potash crop nutrients. They produce source for phosphates, potash, nitrogen fertilizers and feed ingredients.

Responsibilities

Responsible for requirement gathering from different Business users as well as application users

Was responsible for creating specifications covering functional, technical design of different reports

Worked on development, testing and rollout of various reports

Involved in development of Actuate Reports.

Developed various reports including master detail, Cross-Tab, Sub-reports, Sequential and Parallel.

Used various features of Actuate like Data Filters, Single Input Filter, Memory Data Sorter, Dynamic Frames and Controls.

Project Mosaic (Phase II)

Operating System Windows OS

Special Software Actuate e Report Design Professional, iServer, Oracle 9i

Pune

Personal Details

Date of Birth March 12, 1986

Sex Male

Nationality Indian

Date of Joining July 13, 2011

Designation Assistant Consultant

Location Pittsburgh,PA,15220



Contact this candidate