SUMMARY
•Data Analyst with Around * years of experience in Data Analysis, Machine Learning, Data mining with large data sets of Structured and Unstructured data, Data Acquisition, Data Validation, Predictive Modeling, Data Visualization, and Web Scraping.
•Experience using various R and Python packages, ggplot2, pandas, NumPy, Seaborn, SciPy, Matplotlib, sci-kit-learn, Beautiful Soup.
•Proficient in Power BI, Tableau, Qlik, and R-Shiny data visualization tools to analyze and obtain insights into large datasets and to create visually powerful and actionable interactive reports and dashboards.
•Expertise in transforming business requirements into analytical models, designing algorithms, building models, and developing data mining and reporting solutions that scale across a massive volume of structured and unstructured data.
•Experience with Data Analytics, Data Reporting, Ad-hoc Reporting, Graphs, Scales, PivotTables and OLAP reporting.
•Extensive experience with OLTP/OLAP System and E-R Modelling, developing Database Schemas like Star schema and Snowflake schema used in relational, dimensional, and multidimensional Modelling.
•Expertise in handling and optimizing SQL queries in Oracle, SQL Server, MS Access, and Teradata.
•Good industry knowledge, analytical &problem-solving skills, and ability to work well within a team and as an individual.
SKILLS
Methodologies: SDLC, Agile, Waterfall
Language: Python, R, SQL, SAS
IDEs: Visual Studio Code, PyCharm, Jupiter Notebook
Packages: NumPy, Pandas, Matplotlib, SciPy, ggplot2, Scikit-Learn, PyTorch, TensorFlow, Keras, Spark
Data Processing/Techniques: Data Analysis, Data Visualization, Data Analytics, Data Modeling, Data Entry, Data Mining, ETL, Data Management
Visualization Tools: Tableau, Qlik, Power BI, Microsoft Excel Cloud Technologies: AWS, GCP
Database: MySQL, SQL Server, Oracle, MongoDB
Concepts/Fields: Statistics, Business Intelligence, Data Science, Statistical Analysis, Analytics
Software / Other Skills: Jira, Data Cleaning, Data Wrangling, Critical Thinking, Communication Skills, Presentation Skills, Problem-solving, Decision Making, Google Sheets, Information Management, Google App Script, Microsoft Office, Snowflake Operating System: Windows, Linux
EDUCATION
Master in Electrical Engineering (Machine Learning) San Jose State University, San Jose, CA
Bachelor in Instrumentation Engineering Rajiv Gandhi Institute of Technology, Mumbai, Maharashtra
WORK EXPERIENCE
Centene, USA Data Analyst Jan 2023-Current
•Worked in an Agile environment where we had a daily standup, weekly sprints, and defining test scenarios and strategies.
•Designed an A/B experiment to test the business performance of the new recommendation system.
•Used packages like ggplot2 in R Studio for data visualization and low graphs to identify the relationship between variables.
•Created real-time dashboards in Tableau to visualize and monitor key metrics and A/B test processing using external and internal data.
•Involved in Data flow analysis, Data modeling, Physical database design, forms design and development, performance analysis, and tuning.
•Developed Python modules for machine learning and predictive analytics on AWS.
•Performed Data mapping and logical data modeling, created class diagrams and ER diagrams, and used SQL queries to filter data within the Oracle database.
•Worked closely with the data stewards and architects to ensure correct and related data was captured in the data warehouse as part of the data quality check.
San Jose State University Research Foundation Apr 2022 – Aug 2022
•Constructed deep learning LSTM architecture and recurrent neural network with MPC to accurately forecast vagus nerve stimulation efficacy, enhancing prediction accuracy by 10%.
•Analyzed large datasets of physiological data using MATLAB to identify patterns and relationships to inform the development of ML models.
•Leveraged rich insights from training to accomplish a remarkable 15% increase in out-of-sample predictive performance, boosting the prediction rate and increasing the accuracy of the model.
•Boosted productivity by 20%, meeting project deadlines through time management, and enhancing research efficiency.
Groovy Web, India Data Analyst April 2019 - July 2021
•Involved in developing campaign waterfall reports per the business requirements to facilitate highly accurate decisions.
•Used Pandas as an API to put the data in a time series and tabular format for data manipulation and retrieval.
•Worked on AWS Data Pipeline to configure data loads from S3 to Redshift.
•Established Databricks ETL pipelines using notebooks, Spark Data frames, SPARK SQL, and Python scripting.
•Developed complex SQL queries using stored procedures, common table expressions (CTEs), and temporary tables to support Power BI reports.
•Extensively designed Data mapping and filtering, consolidation, cleansing, Integration, ETL, and customization of data mart.
•Designed and maintained MySQL databases, created pipelines using user-defined functions, and stored procedures for daily reporting tasks.
•Analyzed complex datasets using statistical models and machine learning algorithms, resulting in a 15% increase in customer retention rates.
•Experienced Data Analyst with an understanding of Data Mapping, Data warehousing (OLTP, OLAP), Data Mining, Data Governance, and Data management services with Quality Assurance.