Post Job Free

Resume

Sign in

Data Analyst

Location:
Clifton, NJ
Salary:
$45/hr
Posted:
September 23, 2019

Contact this candidate

Resume:

*+ years of professional experience in Data analytics & Visualization.

Working experience in Software Development life cycle (SDLC). Well versed with various types of Software Development methodologies- Waterfall, Agile, RUP, Iterative and extreme Programming.

Experience in Analytics - With expertise in Predictive and Prescriptive analytics and data mining techniques with emphasis on Machine Learning algorithms (Supervised and Unsupervised learning) using SAS and Python programming.

Involved in various projects related to Data Modelling, System/Data Analysis, Design and Development for both OLTP and Data warehousing environments, Data Extraction, Data Cleansing, Data Profiling, Data Mining, Data Consolidation, and Data Quality for various business data feeds.

Strong experience in UNIX shell scripting, writing UNIX wrapper scripts and monitoring the UNIX logs to check for any errors.

Worked with the UNIX shell scripts, Tested UNIX batch jobs according to the specifications and the functionality.

Wrote custom SQL queries to optimize the extracts, and heavily relied on data blending as the end workbooks ultimately required multiple fields from different data sources.

Experience in using statistical languages (SAS, Python) to Clean, Integrate, Transform, Reduce, Analysed and Interpret large sets of Data. Experienced in Python's modules NumPy, matplotlib, pandas etc for data pre-processing, web scraping, visualization and for machine learning.

Practical understanding of the Data modelling (Dimensional & Relational) concepts like Star-Schema Modelling, Snowflake Schema Modelling, Normalization/De-normalization, Fact and Dimension tables, Implemented Slowly Changing Dimensions - Type I & II in Dimension tables as per the requirements.

Extensive working experience in RDBMS technologies like Oracle, MS SQL Server, Excel and MYSQL, with good knowledge of SQL, TOAD, SQL Plus, Win SQL. Good at working with SQL Assistant in Teradata environment.

Ability to develop complicated SQL script for Data validation testing by running SQL script, procedures.

Established and maintained comprehensive data model documentation including detailed descriptions of business entities, attributes, and data relationships.

Developed dash boards with rich Graphic visualizations, Drill Down and Drop-down menu options, parameterized using Tableau reader.

Experience stunning visualizations using Tableau software and publishing and presenting dashboards, Storyline on web and desktop platforms.

Excellent written and oral communication skills and a team-player with a results-oriented attitude.

ETL Tools:

Informatica, SSIS, SSRS

OLAP Tools:

SAP Business Objects, MicroStrategy, IBM Cognos

Data Monitoring:

Tableau reader, Tableau server, Data Blending, Storyline, Flask

Languages:

T-SQL, SQL, Python, SAS

Databases:

Oracle, SQL Server, MySQL, Teradata

Tools:

NumPy, matplotlib, pandas, TOAD

Methodologies:

SDLC, Waterfall, Agile, RUP, Iterative

Operating Systems

Windows, Linux, UNIX Sun Solaris

Master of Science in Biomedical Engineering

Binghamton University, Binghamton, NY

UnitedHealth Group, TX Jan 2019 - Current

Data Analytic / Data Analyst

Roles & Responsibilities

Performing Data Pre-processing using Python/SAS based on the nature of the source system.

Performing statistical analysis, data mining and retrieval processes on a large amount of data to identify trends related to Defaulting prediction model, figures and other relevant information using SAS and python.

Analysed Tableau reports utilizing Data Blending, multi-level hierarchy, Table Calculations, Parameters, Graphs and Actions.

Designed, developed and implemented Power BI Dashboards, Scorecards & KPI Reports.

Project Lead accountable for providing strategic and tactical leadership to ensure full project life cycle design, development and implementation of database solutions.

Involved in developing Unix Shell wrappers to run various SQL Scripts.

Extensively created UNIX shell scripts for scheduling and running the required jobs.

Created shell scripts and used Unix commands to manipulate input files for test execution.

Performed due-diligence with the existing data in data warehouse, Generated ad-hoc SQL queries using joins, database connections and transformation rules to compare and evaluate the prediction models.

Used Informatica Designer, Workflow Manager and Repository Manager to create source and target definition, design mappings, create repositories and establish users, groups and their privileges.

Extracted data from the databases (Oracle and SQL Server, FLAT FILES) using Informatica to load it into a single data warehouse repository.

Experience in building models with deep learning frameworks like TensorFlow, PyTorch, and Keras.

Extensively worked with Python 3.6 (NumPy, Pandas, Matplotlib, NLTK, spaCy, and Scikit - learn)

Experienced in Python data manipulation for loading and extraction as well as with python libraries such as matplotlib, NumPy, SciPy and Pandas for data analysis.

Scraped and collected data by using python package processed and cleaned raw data from a wide variety of financial websites, transformed and converted unstructured data set into structured data products using multiple functions in pandas.

Automated the process of web login and extraction of financial data using selenium web driver and associated libraries in Jupyter notebook as well as PyCharm.

Walgreens, NY May 2018 – Dec 2018

Data Analytic

Roles & Responsibilities:

Used Agile Methodology and SCRUM Process.

Developed and maintained Python ETL scripts to scrape data from external sources and load cleansed data into a SQL Server. The data was used for daily electrical power virtual trading activities in several markets.

Implemented discretization and binning, data wrangling: cleaning, transforming, merging and reshaping data frames using Python.

Propose appropriate analytics to meet business objectives based on the study Protocol and monitoring plan.

Used ETL (SSIS) tasks to develop jobs for extracting, cleaning, transforming and loading data into data warehouse.

Experience managing large projects with a close attention to detail and high-quality performance.

Deployment and Scheduling SSIS Package through SQL Server and file system Report development in SSRS.

Project monitoring, Status reporting and client interaction.

Monitoring the critical components of the System.

Implementation and support of systems in production environment.

Supports all production analytics and can effectively troubleshoot different types of issues - data/analytics.

Contribute towards overall process optimization and metrics framework.

Lamda, India May 2015 – Jun 2017

Data Analytic

Roles & Responsibilities:

Analysis of functional and non-functional categorized data elements for data profiling and mapping from source to target data environment. Developed working documents to support findings and assign specific tasks

Successfully worked on Data visualization with tools like Excel, Quick View.

Monitored automated data pipelines from various external data sources (web pages, API etc) to internal data warehouse (SQL server), then export to reporting tools by Python.

Evaluated the results obtained from the system by simulating the system on real subjects using data visualization techniques in Excel and Matplotlib in Python.

Created Power BI visualization of Dashboards & Scorecards (KPI) for Finance Department

Designed and implemented a Monthly Resource Allocations system using Excel Power Query & Power Pivots to be included in dashboard reporting.

Assisted Supply chain analysts with automating reporting functionality using Power BI tools Power BI reporting, Dashboards & Scorecards (KPI) and MySQL, AWS & Data warehouse data sources.

Analyzed business requirements, created business/process flow charts, database flow charts and reporting tool database for Research Revenue Management tracking for Expedia.

Monitored Tableau Data Visualization reports using Pareto's, Combo charts, Heat Maps, Box and Whisker plots, Scatter plots, Geographic Map and Cross tabs.

Expert in dimensional data modelling using SSAS and Erwin, cube partitioning, optimization, creating aggregations.

Worked with delivery of Data & Analytics applications involving structured and un-structured data on Hadoop based platforms.

Performed performance improvement of the existing Data warehouse applications to increase efficiency of the existing system.

Performed data cleaning and imputation of missing values using Python and used Hive to store the data and perform data cleaning steps for huge datasets.

Kaustubh More

Data Analyst

Location: TX

Mobile:210-***-****

Email: adafcc@r.postjobfree.com

Summary

Skills

Education

Experience



Contact this candidate