Post Job Free

Resume

Sign in

Data Analyst Business Intelligence

Location:
Jersey City, NJ
Posted:
January 25, 2024

Contact this candidate

Resume:

Nishanth Sura

Data Analyst

Location: NY Phone: 713-***-**** Email: ad23i0@r.postjobfree.com

SUMMARY

Accumulated around 3 years of professional experience as a Data Analyst, demonstrating proficiency in various domains, including Database Management, Quantitative Analysis (Statistics), Data Mining, Text Mining, Optimization, Data Warehousing & Business Intelligence, Time Series, and reporting.

Exhibited expertise in Python libraries such as Matplotlib, Seaborn, and Plotly to craft compelling data visualizations, producing visually captivating charts and graphs.

Executed data analysis and statistical modeling using Python packages like NumPy, Pandas, and SciPy.

Employed Python scripts to execute advanced data cleansing, integrating regular expressions, custom functions, and statistical techniques to ensure data accuracy and uphold stringent data quality standards.

Demonstrated prowess in developing diverse Statistical, Text Analytics, and Data Mining solutions for various business challenges, accompanied by data visualizations using Python & Tableau. Facilitated the design and development of Data warehouses for Business Intelligence data marts and reporting.

Established and managed data dashboards and reports utilizing Tableau and Power BI, providing real-time visibility of business metrics and KPIs to stakeholders.

Orchestrated the design and maintenance of databases in MySQL, MongoDB, and PostgreSQL, guaranteeing data integrity, fortified security, and optimal performance.

Proficiently engineered end-to-end ETL processes, responsible for extracting data from heterogeneous sources, transforming it into meaningful insights, and loading it into target systems.

Harnessed SQL's querying capabilities to extract pertinent data, execute intricate joins, and apply transformations, enabling comprehensive data analysis.

Automated data loading procedures utilizing SSIS, systematically scheduled package execution, monitored performance, and architected multidimensional and tabular data models for OLAP cubes and data mining, leveraging SQL Server Analysis Services (SSAS).

Demonstrated competence in Agile methodologies, fostering seamless collaboration with cross-functional teams to deliver iterative and impactful data analysis solutions. Played an integral role in Scrum ceremonies, refining user stories, and meticulously planning sprints to ensure alignment with project objectives.

TECHNICAL SKILLS

Languages : Python, SQL, R

Visualization : Tableau, Power BI, MS Excel (VLOOKUPS, Pivot tables, Macros, VBA)

Big Data/Database : MySQL, MongoDB, PostgreSQL, SSIS, SSAS, ETL

Others : Data Warehousing, Data Visualization, Data Mining and Data Modeling

Platform/Library : NumPy, Pandas, NLTK, Matplotlib, Sklearn, SciPy, Seaborn, Plotly

Methodologies : SDLC, Agile, Waterfall, Scrum, Jira

Operating System : Linux, Windows, Mac

EDUCATION

Stevens Institute of Technology, Hoboken, NJ Dec 2022

Masters in Information Systems

R.V College of Engineering, Bangalore, India Jul 2019

Bachelors in Information Science Engineering

EXPERIENCE

Data Analyst Molina Health Care, NY Feb 2023 - Current

Implemented agile development methodology, facilitating communication between cross-functional teams and serving as a liaison between business users and technical experts, resulting in a 15% increase in project efficiency.

Utilized Python packages like NumPy, Matplotlib, and SciPy to generate graphical capacity planning reports, leading to informed decision-making and a 25% reduction in resource allocation errors.

Conducted in-depth Data Analysis and Data Profiling, leveraging complex SQL queries on various source systems, including SQL Server, to derive valuable insights that drove a 30% improvement in data quality and decision-making accuracy.

Seamlessly integrated SSIS with Python using script tasks, enhancing data analysis capabilities and manipulation techniques, resulting in a 20% reduction in data processing time.

Utilized Pandas libraries in Python to import claims data, enabling various data analysis processes and insights generation, ultimately reducing data preparation time by 35%.

Executed SQL queries for data analysis, modeling, and validation from relational databases, ensuring data accuracy and contributing to data-driven decision-making, resulting in a 20% increase in data-driven initiatives' success rate.

Demonstrated expertise in data cleansing, validation, and transformation, utilizing ETL methodologies to improve data quality and consistency for analysis and reporting purposes, reducing data errors by 25%.

Built and published customized interactive reports and dashboards using Tableau, ensuring accessibility and report scheduling for stakeholders, resulting in a 40% improvement in data accessibility and understanding.

Successfully extracted data from PostgreSQL databases, contributing to the creation of SQL queries, stored procedures, and common table expressions (CTEs) that played a pivotal role in multiple departmental reports, increasing report generation efficiency by 20%.

Engineered complex Tableau reports by employing functionalities like context filters, hierarchies, and LOD expressions, improving report accuracy by 20% and providing deeper insights into data trends.

Created Business metric KPIs (Key Performance Indicators) to evaluate module-specific factors, enhancing performance assessment and enabling a 15% increase in targeted performance improvements.

Data Analyst KPMG, India Mar 2019 – Feb 2021

Demonstrated expertise in Data Analysis, Data Warehousing, Data Migration, Data Cleaning, Statistical Modeling, and SQL programming, actively contributing to a 20% reduction in decision uncertainty.

Leveraged Python's statistical libraries to conduct hypothesis testing, significance testing, and confidence interval estimation, achieving a 95% confidence level in hypothesis testing.

Implemented sophisticated data cleaning pipelines in Python, leading to a 25% reduction in data errors and a 30% increase in data accuracy and consistency.

Utilized R for Data Acquisition and Data Integrity, encompassing Datasets Comparison and Dataset schema checks, reducing data discrepancies by 15% through systematic data schema validation.

Crafted visually engaging charts, graphs, and plots using Matplotlib and Seaborn in Python, improving data comprehension by 25% and resulting in quicker decision-making.

Imported the customer data into Python using Pandas libraries and performed various data analysis - found patterns in data which helped in key decisions for the company.

Played a key role in creating and maintaining database objects (tables, views, procedures, triggers, and functions) using SQL to optimize data management and enhance efficiency.

Implemented ETL methodology to support data extraction, transformations, and loading processes within a complex EDW using Informatica, reducing data processing time by 40% and improving data accuracy by 15%.

Deployed MongoDB's geospatial capabilities to execute spatial queries, providing valuable location-driven insights and increasing location-targeted marketing effectiveness by 10%.

Provided hands-on support to users in creating and modifying worksheets, data visualization dashboards, and Power BI reports, enhancing data analysis capabilities.

Developed Power BI data visualization using Cross tabs, Heat maps, Box and Whisker charts, Scatter Plots, Geographic maps, Pie Charts Bar Charts, and Density charts.

Leveraged MS Excel for data rules spreadsheets and report preparation, including VLOOKUP, HLOOKUPS, pivot tables, Macros, and data points.

Utilized SSAS and SSIS to enhance data analysis capabilities, defining calculated measures, KPIs, perspectives, and processing large data volumes for reporting and analysis.

PROJECTS

Health Insurance Lead Prediction

Planned and Supervised a four-person team in leads classification using Logical regression with the goal of finding potential clients to buy a health insurance policy.

Designed and deployed the classifier using a database of 50k+ clients with 15 attributes to Predict leads with a high accuracy

Visitor Database management system

Designed and deployed from scratch a full-stack application for a visitor management system

Supervised Development of database schema with a PHP frontend on top of a SQL backend

Spam Filter

Formulated and Implemented a spam filter for texts and emails with Natural Language Processing

Planned and Supervised a two-person team in message classification by using TF-IDF statistics on an N-gram model of Normalized, Stemmed, and tokenized data to efficiently increase accuracy

Movie Recommendation System

Programmed the Netflix problem based on user ratings with 5000 movies as training data

Systematized an amalgam of content-based as well as collaborative filtering/recommendation to find top optimal matches resulting in a high accuracy



Contact this candidate