Surabhi Kator
Harrison, NJ, ***** admap1@r.postjobfree.com +1-973-***-**** LinkedIn Github Tableau EDUCATION
Master’s in Information Technology and Analytics January 2021 Rutgers University, Rutgers Business School, Newark, NJ GPA: 3.56/ 4.0 Coursework: Intro to Data Science, Business Analytics Programming, Project Management, Statistics & Machine Learning, Data Analysis & Visualization, Business Forecasting, Business Data Management, Analytics for Business Intelligence Bachelor of Engineering in Computer Engineering June 2018 University of Mumbai, India GPA: 8.14/ 10.0
Coursework: Relational Database Management Systems, Data Warehouse & Mining, AI, Cloud Computing TECHNICAL SKILLS
Programming: Python, SQL (Joins, Window Functions), R programming, NoSQL, C, C++, Java, HTML, CSS, JavaScript Tools: Advanced Microsoft Excel (VLOOKUP, Pivots, VB), Tableau, Power BI, R Studio, Alteryx, MS Access, AWS Skills: Predictive Analytics, Data Wrangling, A/B Testing, Data Visualization, Time Series Forecasting, Data Modeling PROFESSIONAL EXPERIENCE
Research Assistant, Rutgers Business School, New Jersey February 2021 - Present
• Conducted data wrangling on 100k+ rows of raw data to spot missing values, identify outliers, clean the data, and format
• Used decision tree classification model on the cleaned data to predict the outcome of the model using Alteryx Data Analyst Intern, Changing the Present, New York, New York September 2020 – December 2020
• Worked with the web analytics team to perform data cleaning and feature extraction of datasets using Python
• Identified, analyzed, & interpreted data trends in complex data sets using statistical techniques to provide deeper insights
• Showcased 5+ business analytics dashboards to explore market patterns in the data with Tableau Part-time Lecturer, Rutgers University, New Brunswick, NJ January 2020 – December 2020
• Developed and delivered engaging lectures on SQL, Advanced Microsoft Excel, Python to 100+ undergraduate students
• Strategically planned, implemented, and tracked lesson objectives and assessments of the students
• Maintained excellent communication with students & aimed at clearing the basics to make them understand the concepts Business Data Analyst, Internship, Signal Alliance Pvt Ltd, India January 2019 – July 2019
• Designed dashboards in Tableau and Microsoft Excel to analyze the monthly sales of the organization
• Automated data validation process in Excel using Macros and VBA and increased the overall processing speed by 20%
• Implemented Data Modeling using Apache Cassandra and build an ETL pipeline using Python and PostgreSQL
• Managed a team of 5 and developed, maintained timelines, schedules for the team & identified tasks on critical paths Database Analyst, Internship, DKNS Computer Systems, India June 2018 – December 2018
• Monitored and processed 10 years of company data in the database using SQL scripts in MySQL Workbench
• Created, scheduled, and maintained the company database by writing complex SQL queries and window functions
• Performed query optimization and structured the database to improve the overall performance of the system by 25 % PROJECT EXPERIENCE
Market Basket Analysis (Master’s Capstone Project) Python, Jupyter Notebook, Data Analysis
• Analyzed a dataset of over 3 million grocery orders from 200,000 Instacart users to analyze consumer trends & patterns
• Generated association rules between 200+ products at Instacart using Apriori association rule mining algorithm
• Implemented a recommendation system using collaborative filtering, which would predict the items bought together Netflix vs. Disney Plus Data Visualization Tableau, Data Visualization
• Created 2 dashboards in Tableau for Netflix and Disney Plus to analyze and compare the attributes associated with them
• Generated key insights from the visualizations and analyzed the aspects for both streaming services, and compared them Movie Recommendation System Python, Jupyter Notebook, Exploratory Data Analysis
• Created a movie recommendation engine in Python on a dataset that contains TMDB ratings of 45,000 films
• Used Python libraries like NumPy and Pandas and algorithms like regression, classification to predict movie revenues and got an accuracy of 80% for the data using the Gradient boosting Classifier algorithm Tesla Stock Price Prediction R, R Studio, Time Series Forecasting
• Scraped data from the yahoo finance website to collect the Tesla stocks for the last 6 years from 2015 to 2021
• Conducted time series forecasting on the data in R studio using ARIMA model to predict the future stock prices and fit the model with an error rate of 2.68%