San Jose, California, United States
March 25, 2019

Experienced Data Enthusiast with experience working with real world data sets, wrangling the messiest of data, and creating thoughtful visualizations. Masters graduate with specialization in Data Analytics & Data Science EDUCATION

Master of Science in Information Systems Santa Clara University GPA - 3.5/4.0 Jan 2017 - Dec 2018 Bachelor’s in Computer Science Sikkim Manipal University GPA - 3.3/4.0 Jul 2011 - Jul 2015 TECHNICAL SKILLS

Scripting Languages: Advanced: Python Intermediate: R Basic: SAS Programming Languages: Advanced: SQL Intermediate: C++, C# Basic: HTML, PHP Databases: MS Access, SQL Server, MySQL, MongoDB, Redshift Data Visualization: Tableau, Excel, Google Sheets, Power BI Distributed Systems: Hadoop/MapReduce, AWS, Pig, Hive, Apache Spark Office Software: Advanced Excel (solver, decision tree, macros, pivot tables, VLOOKUP) Machine Learning: Regression, Naïve Bayes, k-means, Random Forest, SVM, Deep learning Tools: Jupyter Notebook, R Studio, Microsoft office Suite, GitHub, Google Analytics, Kibana, JIRA, ETL (Pentaho)


Data Analyst Intern Automation Anywhere, San Jose, CA June 2018 - Aug 2018 Technical Skills: Jenkins, Groovy, Python, Logstash, Elastic Search, Kibana, C#

• Performance tuned data processing activities by developing pipeline scripts in Jenkins using Indexing logic for Elastic Search to process 5000 metrics per sec

• Analysed Jenkins build logs to identify bottlenecks and worked on eliminating failures during pipeline execution. Achieved 100% error free execution.

• Developed real time dashboards to evaluate the performance of builds for strategic data analysis

• Automated daily tasks by creating bots using C# to increase productivity by 40% on AWS and Google cloud Systems Analyst Tata Consultancy Services, Kolkata, India Aug 2015 - Jan 2016 Technical Skills: Layer 7, Log Analysis, MS Excel, Python, MS Access, MySQL, Tableau

• Conducted exploratory data analysis to investigate the health of servers to ensure high uptime & fast response time

• Performed log analysis using pattern detection and recognition, normalization, tagging and classification to reduce network issues on the production servers by 30%

• Extracted Data from BMC remedy and used MS Excel and Python features to identify KPI’s like SLA, Issue Level, Resolution time, Reason, Resource utilized

• Delivered interactive dashboards (Tableau) to interpret the key SLA metrics and offered actionable insights to analyse time taken for resolving tickets for effective managerial decisions ACADEMIC PROJECTS ( ) Data Warehouse – Indian Premier League Insights (SQL, SSIS, Tableau)

• Built ETL pipeline using SSIS to extract data from multiple sources, created mapping and transformation based on objective and stored it in SQL database.

• Ensured data quality and integrity by using custom data validation and stored procedures for metadata summarization for the data warehouse tables

• Created tableau dashboards for team owners by aggregating player statistics, identifying playmakers and losers in order to decide which players to retain for the next season Page 2 of 2

Database Project - Community Management System (SQL, Stored procedures, Indexes and Triggers):

• Designed normalized relational database schema for a residential community to enhance living experience for each household and identified key metrics like vacant apartments, prospective resident Conversion Rate, Maintenance Resolution Rate

• Optimized query performance by using Stored Procedures, Functions, Indexes and Triggers. Defined views in SQL to consolidate data for reporting

Machine Learning based Job Application website (MySQL, R, AWS, HTML, CSS)

• Built self-evaluation test based website to connect job seekers with recruiters based on their skills and interests

• Created data pipeline to extract test results gathered from the users and transferred the data to logistic regression module to predict chances for the recruiter to reach out to the job seeker

• Calculated scores based on the scoring matrix designed for the test results obtained and analysed the test results to give detailed feedback with recommendations to improve skills for the candidates with low scores using R Shiny Data Analysis & Forecasting on Sales Data (MS excel, python, Machine Learning, Tableau)

• Visualized tableau dashboards to report trends on the sales data explored and identified key metrics such as game revenue, server cost, POS revenue to evaluate business performance

• Used statistical techniques for data cleaning, merging and modelling 1 million rows dataset and performed hypothesis testing to evaluate customer reactions to new games on server-assistant devices in the restaurant

• Developed time series models(AR, MA, ARIMA) to forecast monthly, quarterly, yearly sales figures and suggested recommendations to increase revenue

Data Analysis on Data Breaches: (Python – pandas, NumPy, seaborn, matplotlib, yahoo finance, Tableau)

• Wrangled Data Breaches dataset to study the impact of an organization on the stock market based on the and the customer behaviour post security breach

• Web scraped and collected stock data for the breached firm using requests, Beautiful Soup from the day of the breach to evaluate market reaction

• Presented key insights using persuasive and exploratory storyboard using metrics like Industry, Data Sensitivity, Records lost and average Stock Price

Exploratory Data Analysis and Prediction System: (Python- classification & regression model)

• Implemented Logistic regression to analyse Young People Survey dataset to predict lonely & non-lonely individuals based on given set of behaviours

• Performed descriptive analysis using decision trees to understand factors leading to alcoholism and insights confirmed theory that underage drinking is quite common even though it is illegal ADDITIONAL INFORMATION

Teaching Assistant for Business Intelligence & Data Warehousing Course Feb 2018 – June 2018

• Formulated, graded assignments and projects for graduate students. Guided students on lab sessions and collaborated on discussion forums

Teaching Assistant for The Business of Cloud Computing Course Feb 2018 – Aug 2018

• Held office hours to clarify graduate student’s doubts on AWS EC2, S3, Redshift, RDS. Created projects and graded. Certified in Google Analytics

• Designed personal portfolio website and optimized search-engine friendliness, with a concentration on relevant keyword placements and enhanced title-tag selections; resulting in improved website ranking

• Utilized Google Analytics to track visitor flow and interaction throughout the website

