Post Job Free
Sign in

data analyst

Location:
United States
Posted:
June 07, 2023

Contact this candidate

Resume:

Rishi

Sr. Data Analyst

Email: ********************@*****.***

Ph: 469-***-****

Professional Summary:

Professional qualified Data Analyst with over 7+ years of experience in Data Science and Analytics including Machine Learning, Data Mining and Statistical Analysis

Involved in the entire data science project life cycle and actively involved in all the phases including data extraction, data cleaning, statistical modeling and data visualization with large data sets of structured and unstructured data

Experienced with machine learning algorithm such as logistic regression, random forest, XGboost, KNN, SVM, neural network, linear regression, lasso regression and k-means

Implemented Bagging and Boosting to enhance the model performance.

Experience in implementing data analysis with various analytic tools, such as Anaconda 4.0JupiterNotebook 4.X, R 3.0 (ggplot2, Caret, dplyr) and Excel

Solid ability to write and optimize diverse SQL queries, working knowledge of RDBMS like SQLServer2008, NoSQL databases like MongoDB3.2

Experience in visualization tools like, Tableau9.X, 10.X for creating dashboards

Solid understanding of statistical software packages like SAS, STATA

Excellent understanding Agile and Scrum development methodology

Used the version control tools like Git 2.X

Passionate about gleaning insightful information from massive data assets and developing a culture of sound, data-driven decision making

Ability to maintain a fun, casual, professional and productive team atmosphere

Experienced the full software life cycle in SDLC, Agile and Scrum methodologies.

Skilled in Advanced Regression Modeling, Correlation, Multivariate Analysis, Model Building, Business Intelligence tools and application of Statistical Concepts.

Experience with querying on data present in Cassandra cluster using CQL (Cassandra Query Language).

Proficient in Predictive Modeling, Data Mining Methods, Factor Analysis, ANOVA, Hypothetical testing, normal distribution and other advanced statistical and econometric techniques.

Developed predictive models using Decision Tree, Random Forest, Naïve Bayes, Logistic Regression, Cluster Analysis, and Neural Networks.

Experienced in Machine Learning and Statistical Analysis with Python Scikit-Learn.

Experienced in Python to manipulate data for data loading and extraction and worked with python libraries like Matplotlib, NumPy, SciPy and Pandas for data analysis.

Worked with complex applications such as R, SAS, MATLAB and SPSS to develop neural network, cluster analysis.

Expertise in transforming business requirements into analytical models, designing algorithms, building models, developing data mining and reporting solutions that scales across massive volume of structured and unstructured data.

Skilled in performing data parsing, data manipulation and data preparation with methods including describe data contents, compute descriptive statistics of data, regex, split and combine, Remap, merge, subset, reindex, melt and reshape.

Strong SQL programming skills, with experience in working with functions, packages and triggers.

Experienced in Visual Basic for Applications and VB programming languages to work with developing applications.

Worked with NoSQL Database including HBase, Cassandra and MongoDB.

Experienced in Data Integration Validation and Data Quality controls for ETL process and Data Warehousing using MS Visual Studio SSIS, SSAS, SSRS.

Proficient in Tableau and R-Shiny data visualization tools to analyze and obtain insights into large datasets, create visually powerful and actionable interactive reports and dashboards.

Automated recurring reports using SQL and Python and visualized them on BI platform like Tableau.

Worked in development environment like Git and VM.

Excellent communication skills. Successfully working in fast-paced multitasking environment both independently and in collaborative team, a self-motivated enthusiastic learner.

Technical Skills:

Web Technologies JDBC, HTML5, DHTML and XML, CSS3, Web Services, WSDL

Tools Erwin r 9.6, 9.5, 9.1, 8.x, Rational Rose, ER/Studio, MS Visio, SAP Power designer.

Databases SQL, Hive, T-SQL, Impala, Pig, Spark SQL, Databases SQL-Server, My SQL, MS Access, HDFS, HBase, Teradata, Netezza, Mongo DB, Cassandra.

Reporting Tools MS Office (Word/Excel/Power Point/ Visio), Tableau, Crystal reports XI, Business Intelligence, SSRS, Business Objects 5.x/ 6.x, Cognos7.0/6.0.

ETL Tools Informatica Power Centre, SSIS.

Version Control Tools SVM, GitHub.

BI Tools Tableau, Tableau server, Tableau Reader, SAP Business Objects, OBIEE, QlikView, SAP Business Intelligence, Amazon Redshift, or Azure Data Warehouse

Operating System Windows, Linux, Unix, Red Hat.

Professional Experience:

Client: State Farm, Bloomington, Illinois May 2022 – Till Date

Sr Data Analyst

Responsibilities:

Responsible for gathering requirements from business analysts and operational analysts and identifying the data sources required for the reports.

Assisted the Data Analytics team with executing, validating, and assembling organized analytics to meet compliance and to create value out of financial data that drives business.

Worked with Change Management Team and Application Development team to deploy codes in QA, UAT, TRN and PROD environments.

Provided end-to-end solutions from exploratory data analysis, model development and ETL using Python to finished dashboard/reports using MicroStrategy

Developed logic and key performance metrics and managed improvement projects to enhance business intelligence reporting tool, QlikView

Designed and implemented dashboards and ad-hoc Excel reports for billing, accounting, accounts receivable, inventory, purchasing and sales departments

Used Python programs automated the process of combining the large SAS datasets and Data files and then converting as Teradata tables for Data Analysis.

Developed Python programs for manipulating the data read from Teradata data sources and convert them as CSV files.

Worked on numerous ad-hoc data pulls for business analysis and monitoring.

Designed and developed various monthly and quarterly business monitoring excel reports by writing Teradata SQL and using in MS Excel pivot tables.

Performed in depth analysis on data and prepared ad-hoc reports in MS Excel and SQL scripts.

Performed verification and validation for accuracy of data in the monthly/quarterly reports.

Developed Teradata SQL scripts using various characters, numeric and date functions.

Created multi-set tables and volatile tables using existing tables and collected statistics on table to improve the performance.

Extract up to date accounts data from Teradata database by using SQL, BTEQ.

Developed the Interfaces in SQL, for data calculations and data manipulations.

Used MS Excel and Teradata for data pools and adhocs reports for business analysis

Performed in depth analysis in data & prepared weekly, biweekly, monthly reports by using SQL, MS Excel and UNIX.

Experience in automation scripting using shell and Python.

Work with cross functional teams planning, modeling, and implementing solutions utilizing NoSQL technologies.

Automated Windows SAS Scripts on UNIX SAS platform Used Python programs to automate the process of combining the large SAS datasets and Data files and then converting as Teradata tables for Data Analysis.

Designed stunning visualization dashboards using tableau desktop and publishing dashboards on tableau server and desktop reader.

Developed programs with manipulate arrays using libraries like NumPy and Python

Environment: Teradata, SQL Assistant, MS office,NoSQL, MS Excel, Agile, Windows, UNIX, SAS EG, SQLPutty.

Client: Cigna, Bloomfield, Connecticut Oct 2021 – Apr 2022

Sr Data Analyst

Responsibilities:

Responsible for Developing and implementing methods of data collection and analysis for use in setting business priorities and strategies.

Constantly working with IT to Identify and further develop reporting capability by migrating the data to cloud, (Cloud Datawarehouse) including exploring new software, to refine ongoing assessment of changes made.

Work with marketing department to provide marketing analytics by creating data visualizations on DOMO- for Marketing Budget, Marketing KPI’s, ROI, Offline Marketing, Marketing Events etc.

Developed Interactive dashboards in PowerBI to track and analyze KPI’s on website that get approximately 500k visitors quarterly

Extracted data from SQL and prepared data for exploratory analysis using NumPy & Pandas.

Participated in all phases of data mining, data cleaning, data collection, developing models, validation, and visualization and performed Gap analysis.

Worked on different data formats such as JSON, SQL, CSV & Excel Datasets and performed machine learning algorithms in Python.

Using Spark shell for interactive data analysis and process using Spark Sql to query structured data

Programmed a utility in Python that used multiple packages (NumPy, SciPy, pandas).

Implemented Classification using supervised algorithms like Logistic Regression, Decision trees, Naive Bayes, KNN.

Conducted K-Means clustering algorithm for segmentation of various customers.

Performed time series analysis for forecasting of sales for next fiscal year using ARIMA based modeling.

Data transformation from various resources, data organization, features extraction, feature engineering and feature pre-processing from data.

Developed Spark applications using Spark - SQL in Azure Databricks for data extraction, transformation and aggregation from multiple file formats for analyzing & transforming the data to uncover insights into the customer usage patterns.

Conducted exploratory data analysis using descriptive statistics & inferential statistics.

Build statistical plots using seaborn and matplotlib to perform the distribution analysis such as CDF, PDF, PMF as well as regression plot and boxplot etc. to get insights.

Performed feature pre-processing such as missing value imputation, outlier treatment, scaling the data and creating dummy variables for machine learning algorithms.

Client: Medtronic, Minneapolis, Minnesota Apr 2020 – Oct 2021

Sr Data Analyst

Responsibilities:

Worked with business analysts, senior project managers, and programmers to gather business requirements and specifications.

As part of this team, I was involved in Retrieve data from Database and developing reports based on the business requirements.

Transformed data from one server to other servers using tools like Bulk Copy Program (BCP), and SQL Server Integration Services (SSIS)

Generating the reports for Teradata and Oracle Database to analyze the customer behavior and plan strategies to improve the response rates of marketing Campaigns.

Created Numerous Volatile, Set, Multiset, Derived, and Global temp tables.

Extensively used joins while extracting data from multiple tables.

Developed Teradata SQL scripts using various characters, numeric and date functions.

Develop / Auto deploy content using AWS (Amazon Web Services), GIT/Bitbucket, Maven, Jenkins.

Create new EC2 instance in AWS, allocate volumes and giving Provisionals using IAM

Improved performance while using performance tuning techniques like primary Index and collection statistics on index columns.

Hands on Experience in Writing Python Scripts for Data Extract and Data Transfer.

Strong Experience in writing Python programs for manipulating the data reading from various Teradata and convert them as one CSV Files.

Strong experience in using Excel & MS Access to dump the data & analyze based on business needs.

Worked with Excel Pivot Tables for various business scenarios

Performed Verification, Validation and Transformations on the Input data (Text files) before loading into target database.

Designed and developed weekly, monthly reports by using MS Excel Techniques (Charts, Graphs, Pivot tables) and Power point presentations.

Used MS Excel and Teradata for data pools and adhocs reports for business analysis

Extensively used Base SAS programs to convert data into Teradata table into CSV and other flat files.

Responsible for analyzing business requirements and developing Reports using PowerPoint, Excel to provide data analysis solutions to business clients.

Client: Ingredion, Bridgewater, NJ Feb 2019 – Mar 2020

Client: Data Analyst

Responsibilities:

Participated in all phases of research including data collection, data cleaning, data mining, developing models and visualizations.

Collaborated with data engineers and operation team to collect data from internal system to fit the analytical requirements.

Worked on migrating existing mainframe data and reporting feeds to Hadoop.

Redefined many attributes and relationships and cleansed unwanted tables/columns using SQL queries.

Integrated Hadoop with Tableau and SAS analytics to provide end users analytical reports

Worked with AWS Cloud platform and its features which includes EC2, VPC, RDS, EBS, S3, CloudWatch, Cloud Trail, CloudFormation and Autoscaling etc.

Used AWS command line client and management console to interact with AWS resources and APIs.

Created data models for customers data using Cassandra Query Language (CQL)

Managing large-scale, geographically-distributed database systems, including relational (Oracle, SQL server) and NoSQL (MongoDB, Cassandra) systems.

Utilized Spark SQL API in PySpark to extract and load data and perform SQL queries.

Performed data imputation using Scikit-learn package in Python.

Performed data processing using Python libraries like NumPy and Pandas.

Worked with data analysis using ggplot2 library in R to do data visualizations for better understanding of customers' behaviors.

Visually plotted data using Tableau for dashboards and reports.

Implemented statistical modeling with XGboost machine learning software package using R to determine the predicted probabilities of each model.

Delivered the results with operation team for better decisions.

Environment: Python, R, SQL, Tableau, Spark, SAS, Stata Machine Learning Software Package, recommendation systems.

Client: Infinite Computer Solutions India Ltd., India Jul 2014 – Dec 2017

Role: Data Analyst

Responsibilities:

Interacted with Business Users for gathering, analyzing, and documenting business requirements and data specifications.

Worked on numerous ad-hoc data pulls for business analysis and monitoring.

Developed spark applications using Scala for easy Hadoop transitions.

Designed and developed various monthly and quarterly business monitoring excel reports by writing Teradata SQL and using in MS Excel pivot tables.

Performed in depth analysis on data and prepared ad-hoc reports in MS Excel and SQL scripts.

Performed verification and validation for accuracy of data in the monthly/quarterly reports.

Developed Teradata SQL scripts using various characters, numeric and date functions.

Created multi-set tables and volatile tables using existing tables and collected statistics on table to improve the performance.

Analyze data using statistical software (Stata) and maintain database files.

Developed Teradata SQL scripts using OLAP functions like rank and rank Over to improve the query performance while pulling the data from large tables.

Utilized ODBC connectivity to connect Teradata from MS Excel to automate the data pull and refresh graphs for weekly and monthly reports.

Experience in performing Dual Data Validation on various Businesses critical reports working with another Analyst.

Developed UNIX shell scripts to run batch jobs and communicate logs to the users.

Automated Teradata SQL scripts in UNIX by using shell scripting.

Developed SAS programs to import the data from Mainframe text file into Teradata Tables.

Wrote SAS Programs to convert Excel data into Teradata tables.

Experience in converting the SQL scripts into SAS SQL scripts.

Used Python programs to automate the process of combining the large SAS datasets and Data files and then converting as Teradata tables for Data Analysis.

Developed Python programs for manipulating the data reading from various Teradata and convert them as one CSV Files.

Developed programs with manipulate arrays using libraries like NumPy and Python.

Education

Bachelors in Electronics and Computer Science from KL University, India.



Contact this candidate