Post Job Free
Sign in

data analyst

Location:
Tempe, AZ
Salary:
170000
Posted:
June 19, 2024

Contact this candidate

Resume:

THARUN KAMMILI

**************@*****.*** +1-623-***-**** Tempe, AZ, 85281

PROFESSIONAL SUMMARY

Over 9 years working experience as Data Analyst with solid understanding of Data Modeling, Evaluating Data Sources, feature building and strong understanding of ETL, BI.

Experience in designing Software Development Life Cycle (SDLC) with good working knowledge of testing methodologies, disciplines, tasks, resources, and scheduling.

Experience in Data Analysis, Data Validation, Data Cleansing, Data Verification and identifying data mismatches.

Ability to collaborate with peers in both business, and technical areas, to deliver optimal business process solutions, in line with corporate priorities. Good experience in banking domain.

Experience in various phases of Software Development life cycle (Analysis, Requirements gathering, Designing) with expertise in documenting various requirement specifications, functional specifications, Test Plans, Source to Target mappings, SQL joins.

Expertise in building data models, migration plan and project completion using Snowflake.

Worked with BI/Reporting tools Tableau, Crystal Reports and Quick Sight having created various visualizations using Bar/Line/Pie charts, Scatter plots, Heat Maps and client reports.

Experienced working with Excel Pivot and VBA macros for various business scenarios.

Excellent knowledge on creating reports on SAP Business Objects, Webi reports for multiple data providers.

Experience in AWS cloud services like EC2, S3.

Hands on experience on various ServiceNow modules like service catalog, incident, change,

problem, reporting etc..

Experience in designing of online transactional processing (OLTP), operational data store and decision support system (DSS) (e.g. Data Warehouse) databases, utilizing data vault (hub and spoke), dimensional and normalized data designs as appropriate for enterprise-wide solutions.

Experience in working with EC2, EMR, Lambda and S3 bucket in AWS using cloud formation template.

Excellent knowledge in preparing required project documentation and tracking and reporting regularly on the status of projects to all project stakeholders.

Remarkable knowledge of design, Normalization and Database Management Concepts.

Experience in MongoDB large scale database systems.

Experience in Python data manipulation for loading and extraxction as well as with Python libraries such as NumPy, Pandas and Spark.

Hands on experience in MDX Expressions, DAX Expressions, Power Bi, Power Pivot, Power integrated with Share Point and in creating dashboards in Power Bi and Tableau visualizations tools.

Excellent experience in Data mining with querying and mining large datasets to discover transition patterns and examine financial data.

Experience in testing Business Intelligence reports generated by various BI Tools like Tableau, Cognos and Business Objects

Extensive knowledge and experience in producing tables, reports, graphs and listings using various procedures and handling large databases to perform complex data manipulations.

Extensive ETL testing experience using Informatica 8.6.1/8.1 (Power Center/ Power Mart) (Designer, Workflow Manager, Workflow Monitor and Server Manager).

Excellent knowledge on creating DML statements for underlying statements.

Mainframes Cobol, CICS, JCL, data analysis, program and system documentation visio.

PROFESSIONAL EXPERIENCE

Sr. Data Analyst – First Republic Bank, California, LA. (June 2021 - Current)

Worked on multiple projects to streamline, automate and optimize internal processes that improved overall efficiency and reporting of the company.

Responsibilities:

Collaborated with Business teams to provide consulting support and understand their process and areas of improvement in the current infrastructure.

Analysed and design the overall structure of the data pipeline and ETL process and highlighted refinements such as change in file formats and code changes.

Worked on snowflake by using SQL in extracting data and transforming it into GL reports.

Designed Business requirement documents which encompasses every requirement by the client and implementation plan.

Maintaining tracker in excel and creating daily reports in pivot tables for business on daily bases and updating stakeholders for any data issues.

Designed and planned upcoming ETL projects which includes ER diagram, overall infrastructure and Data pipeline.

Brokered business teams and technical teams to overcome technical challenges and reconcile business requirements.

Customized Splunk for monitoring application management and security as per requirements.

Ensured cross system communication and handshakes are properly done and logged.

Consistent checks and cross validation done during projects to ensure error free processes in terms of business and technical errors.

Created internal and external stage and transformed data during load. Redesigned the Views in snowflake to increase the performance. Unit tested the data between Redshift and Snowflake.

Experience in working with EC2, EMR, Lambda and S3 bucket and Quick Sight in AWS using cloud formation template.

Performed data cleaning, features scaling, features engineering using Pandas and NumPy packages in Python.

Understanding of statistical concepts and performed various testing which includes regression and end to end testing.

Planned QA/QC test cases and checks to be undergone once the project goes into testing phase and performed various ad-hoc analysis from processed data and creating comprehensive reports for end users.

Used Splunk for analysing logs generated by various operating systems.

Used data vault technique and achieved many advantages of data vault approach some of them are simplified the data ingestion process, removed the cleansing requirement of a star schema process and easily allowed the addition of new data sources without disruption to existing schema.

Data vault used in both a data loading technique and methodology which accommodates historical data, auditing and tracking of data.

Migrated various data sources to AWS S3 and scheduled ETL jobs 8sing AWS GLUE to build tables in AWS.

Design and developed insights reports on AWS Quicksight for client deliverable.

Worked on Tableau for creating dashboards interactive views, trends and drill downs using action filters.

Involved in reviewing business requirements and analyzing data sources form Google Drive,

Google Sheets and designing prototype visualizations

Used confluence to document the project and for release notes with Product Owner and senior management and updated status whenever major changes are incorporated.

Streamlined Issue tracker using JIRA (document Epics and User Stories, decomposed Functional Requirements into User Stories) and highlighted primary point of contact to raise the risks/issues around test cases development and post live issues.

Updated the stakeholders regularly about project timelines, ongoing issues and overall project status to get their view, recommendations on the project.

Created internal and external stage and transformed data during load. Redesigned the Views in snowflake to increase the performance. Unit tested the data between Redshift and Snowflake.

Environment: SQL, Excel, Python, AWS, Stored Procedures, Snowflake, Shell Scripts, Visio, Delimited files, Oracle 10G etc.

Sr. Data Analyst - Ebay, Austin, TX (August 2020 – April 2021)

Responsibilities:

Gather and analyze the business requirements and then translate them to technical specifications.

Work on Google sheets/GSuite/ Tableau to prepare data/Reports for business. Worked on MongoDB database concepts such as locking indexes, sharding, replication schema design etc..

Experience in identifying critical information in process automation, supply chain analytics, building vendor relationships, manufacturing losses reduction, and building claims forecasting models.

Create data quality scripts using SQL and Hive to validate successful data load and quality of the data create various types of data visualizations using Python and Tableau.

Working with data ingestions from multiple sources into Azure SQL data warehouse.

Involved in reviewing business requirements and analyzing data sources form Google Drive,

Google Sheets and designing prototype visualizations.

Implement and test the model on AWS EC2; collaborated with development team to get the best algorithm and parameters.

Design and develop insights reports on AWS Quick sight for client deliverable.

Developed interactive dashboards using Tableau for the supply chain for reporting and analytics team to monitor operational KPIs.

Designed Tableau and Power Bi dashboards which gives a visual representation for the users on visual analytics.

Developed data visualizations and dashboards using Power Bi and Tableau.

Performed end-to-end Data Analysis and ensured the data quality gaps are identified.

Used Data Blending, groups, calculated fields, and aggregated fields to compare and analyze data in different perspectives.

Created Relationships, actions, data blending, filters, parameters, hierarchies, Level-of- Detail (LOD), calculated fields, sorting, groupings, live connections, and in-memory in both tableau and excel.

Identify services and initiate implementation rules for the service catalog in ServiceNow.

Used splunk for analyzing logs generated by various operating systems.

Involved in administration tasks such as publishing workbooks, setting permissions, managing ownerships, providing access to the users and adding them to the specific group and scheduled instances for reports in Tableau Server.

Environment: SQL, PowerBI, Tableau, Python, Servicenow, MongoDB, AWS, MS Office Suite, Visio, Windows XP.

Sr. Data Analyst - Genworth, Virginia. (March 2020 – July 2020)

Analysis of functional and non-functional categorized data elements for data profiling and mapping from source to target data environment. Developed working documents to support findings and assign specific tasks.

Experienced in implementing Spark RDD transformations, actions to implement business analysis.

Performing CRUD operations like read, update, insert and delete records in MongoDB.

Worked with data investigation, discovery and mapping tools to scan every single data record from many sources.

Performed data analysis and data profiling using complex SQL on various sources systems including Oracle and Teradata.

Extensively used ETL methodology for supporting data extraction, transformations and loading processing, in a complex EDW using Informatica.

Tracked program development progress in Atlassian Jira, involved in User Acceptance Testing and supported in SDLC documentation with technical implications.

Used Atlassian JIRA to document Epics and User Stories, decomposed Functional Requirements into User Stories, conducted Storyboard reviews to senior management and worked with Agile team for Iterative Development.

Created Powerbi scorecards, dashboards using stack bars, bar graphs, scattered plots, geographical maps, Gantt charts using show me functionality.

Written several shell scripts using UNIX Korn shell for file transfers, error logging, data archiving, checking the log files and cleanup process.

Implemented Spark using Scala and Spark SQL for faster testing and processing of data.

Performing data management projects and fulfilling ad-hoc requests according to user specifications by utilizing data management software programs and tools like Perl, Toad, MS Access, Excel and SQL.

Written SQL scripts to test the mappings and Developed Traceability Matrix of Business Requirements mapped to Test Scripts to ensure any Change Control in requirements leads to test case update.

Involved in extensive DATA validation by writing several complex SQL queries and Involved in back-end testing and worked with data quality issues.

Implemented Spark using Scala and Spark SQL for faster testing and processing of data.

Assisted in defining business requirements for the IT team and created BRD and functional specifications documents along with mapping documents to assist the developers in their coding.

Worked on various visualizations techniques on Tableau like scatter plot, Histogram and heat maps.

Created different tabular reports using Power Bi features and enhanced them based on user requirements.

Performed data cleaning, features scaling, features engineering using Pandas and NumPy packages in Python.

Updated Python scripts to match the traning data with our database stored in AWS cloud search, so that we would be able to assign each document a response lable for further classifaction.

Experience in servicenow in customization of modules, CMDB policies, Discovery.

Designed and developed database models for the operational data store, data warehouse, and federated databases to support client enterprise Information Management Strategy.

Flexible to work late hours to coordinate with offshore team.

Environment: MS SQL Server 2008 client & SERVER, MS office, Legacy - Mainframes, Titanium, Rational Clear Quest, Clear Case., Servicenow, Python, Tableau, AWS.

Data Analyst – Allstate Insurance, Northfield IL (Nov 2018 – Dec 2019)

Responsibilities:

Involved in Data mapping specifications to create and execute detailed system test plans. The data mapping specifies what data will be extracted from an internal data warehouse, transformed and sent to an external entity.

Analyzed business requirements, system requirements, data mapping requirement specifications, and responsible for documenting functional requirements and supplementary requirements in Quality Center.

Setting up of environments to be used for testing and the range of functionalities to be tested as per technical specifications.

Tested Complex ETL Mappings and Sessions based on business user requirements and business rules to load data from source flat files and RDBMS tables to target tables.

Performed data mining on Claim’s data using very complex SQL queries and discovered Health care claims pattern.

Responsible for different Data mapping activities from Source systems to Teradata Created the test environment for Staging area, loading the Staging area with data from multiple sources.

Responsible for analyzing various data sources such as flat files, ASCII Data, EBCDIC Data, Relational Data (Oracle, DB2 UDB, MS SQL Server) from various heterogeneous data sources.

Delivered file in various file formatting system (ex. Excel file, Tab delimited text, Coma separated text, Pipe delimited text etc.)

Executed campaign based on customer requirements.

Designed ODS and data vault with expertise in loan and all types of cards.

Performed ad hoc analyses, as needed, with the ability to comprehend analysis as needed.

Involved in testing the XML files and checked whether data is parsed and loaded to staging tables.

Responsible for creating test cases to make sure the data originating from source is made into target properly in the right format.

Tested several stored procedures and wrote complex SQL syntax using case, having, connect by etc.

Involved in Teradata SQL Development, Unit Testing and Performance Tuning and to ensure testing issues are resolved based on using defect reports.

Tested the ETL process for both before data validation and after data validation process.

Tested the messages published by ETL tool and data loaded into various databases.

Experience in creating UNIX scripts for file transfer and file manipulation.

Provide support to client with assessing how many virtual user licenses would be needed for performance testing.

Ensuring onsite to offshore transition, QA Processes, and closure of problems & issues.

Tested the database to check field size validation, check constraints, stored procedures, and cross verifying the field size defined within the application with metadata.

Environment: SQL, Python, Tableau, Informatica 8.1, Data Flux, Oracle 9i, Quality Center 8.2,

SQL, TOAD, PL/SQL, Flat Files, Teradata.

Data Analyst - Adequare, Hyderabad (Oct 2014 – Jun 2018)

Responsibilities:

Involved in gathering data from a different team, along with business intelligence team to provide reports, coordinating the deliverables, gathering and documenting the requirements.

Built S3 buckets and managed policies for S3 buckets and used S3 bucket and Glacier for storage and backup on AWS.

Used ETL to develop jobs for extracting, cleaning, transforming and loading the data from various sources.

Experienced Python in data wrangling, cleansing, preparation for analysis.

Performed in-depth analysis of data and prepared daily reports by using SQL, MS Excel, Share Point.

Experienced in translating requirements into actionable reports and providing consulting support to clients that is data-based, analysis-driven, and a strong understanding of customer relationship management.

Performed various ad-hoc analysis by extracting data from multiple source systems and creating comprehensive reports for end users.

Managed Servers on the Amazon Web Services (AWS) platform instances using Puppet configuration management.

Define data needs, evaluate data quality, and extract/transform data for analytic projects and research.

Analyzing data and prototype models for targeting and personalization, work on analytical or experimental requirements to devise data solutions.

Effectively communicate and document technical analyses and results.

Involved closely with Marketing and Operations team to understand/define requirements, domain knowledge/models, and data needs.

Utilize Power Bi to design multiple score cards and dashboards to display information required by different departments and upper level management.

Ensure analysis and solutions drive business decisions.

Developed a solution which will aid in the data capture, data cleansing, data monitoring and reporting of customer data.

Helped define key business problems to be solved; analyze data to solve those problems.

Generated Heat maps to identify the risk and flaws in the business.

Validated Data to check for the proper conversion of the data. Data cleansing to identify unnecessary data and clean, data profiling for accuracy, completeness, consistency.

Assisted and produced standard reports, charts, graphs and tables from a structured data source by querying data repositories using Python and SQL.

Developed and produced a dashboard, key performance indicators and monitor organization performance.

Environment: Python, SQL, Jupyter, NumPy, Tableau, Power Bi, MS Office Suite, Visio, PowerShell, Windows XP.

TECHNICAL SKILLS (LANGUAGES/TOOLS/FRAMEWORKS/CONCEPTS)

Databases: Snowflake, Data Warehousing Informatica 9.1/8.6/7.1.2 (Repository Manager, Designer, Workflow Manager, and Workflow Monitor), dimension tables, Pivot Tables, Erwin. Oracle 9i, 10g,11g, MS SQL Server, MS Access, Teradata.

BI Tools: Cognos, Tableau, Quicksight, Framework Manager, ETL (Data Stage), ESRI Maps, Query

Studio, Analysis Studio, Event Studio, Brio Tool, Metric Studio, Cognos Administration, Access Manager, Splunk, Data Stage, Data Studio, WinSCP, Powerbi

Testing Tools Win Runner, Load Runner, Test Director, Mercury Quality Center, Rational Clear Quest

ETL Tools: Ethority, Informatica Power Center, SQL Loader Data Stage, Informatica Data Quality, SSIS.

Version Control Tool: RDBMS Oracle 10g/9i/8i/7.x, MS SQL Server, UDB DB2 9.x, Teradata, MS Access 7.0

Programming Languages : Python(Polars, Pandas, Matplotlib, Scipy, Scikit Learn, Numpy), SQL(MS SQL Server, MySQL), R (dplyr, dpblyr, ggplot), Embedded C, MATLab, Latex, C++, Power BI, DAX, ETL, SAS, Pentaho, Dataiku, Informatica, Machine Learning & Statistics, Python (Scikit Learn, Pandas, NumPy, Matplotlib, Actimize, seaborn, plotty, cufflinks), Java, AWS, Hadoop, MS SQL Server, DB Visualizer, Oracle DB, Mongo DB, Unix, R programming, Netezza, CA TDM, Datamaker, Data Loader, Agile, Jira, Kanban, SSRS, SSIS, Jira, Visual Studio.

Technologies/ Frameworks : Pycharm, Jupiter, Tableau, JMP, Minitab, MS Office Suite, Visio, MS Project, Jira, Power Bi, Apache Airflow.

Business Concept’s : Product Lifecycle Management, Project Management, Product Supersession, Data Analysis, Lean Six Sigma

Competencies : Machine Learning, Project Planning, Supply Chain Planning

Certifications : Google Data Analytics, Google Project Management (Agile, Waterfall), Supply Chain Planning

ACADEMIC PROJECTS

• Spam Email Classifier Scikit-Learn, Python, Pandas, Numpy, Matplotlib

Built ML model cleaning raw data, encoding and comparing performances (Recall, F1, Accuracy, Precision) of Log. Reg., KNN, SVM, NB, DT, Random Forest, MLP using Grid Search CV and K-Folds selecting optimal parameters.

• Custom Object Detection Yolo V7, Python, Transfer Learning

Employed YOLO V7 object detection algorithms leveraging transfer learning to implement image recognition on a given custom car image dataset and predict bounding boxes with class probabilities of each object in image.

• Social Media Sentiment Analysis

Executed sentiment analysis on social media data using Power BI, incorporating Data Analysis Xpressions (DAX) for advanced calculations and metrics. Leveraged Power BI's visualization features and illustrated sentiment trends, providing valuable insights for data-driven decision-making in social media strategies.

• Sales Data Analysis Polars, Python, Jupyter

Exploratory Data Analysis of the Sales Data through multiple files. It uses python polars for the data manipulation.

• Pose Estimation

Replicated a method for estimating high accuracy, multi-human applicable 2D pose estimate with CNN architecture. The advanced Non-parametric representation method referred to here as PAFs was used.

• Library Database Management System MySql

Created Library Management database, translating an ER model to a relational one. Developed tables, views, and indexes using MySQL, enforced key constraints and normalization. Designed front-end forms with PHP for book operations and user accounts. Rigorously tested for security and efficiency, improving library ops.



Contact this candidate