Data Analyst

Seattle, Washington, United States
January 23, 2018

Harshavardhan Reddy Vempati (716) ***-****


Master of Science, University at Buffalo, Buffalo, United States Aug 2015-Aug 2017

Specialization- Industrial & Systems Engineering (Operations Research), GPA: 3.54/4.00

Bachelor of Technology, JNTU University, Hyderabad, INDIA Sept 2010-Sept 2014

Specialization- Mechanical Engineering, GPA: 3.79/4.00


Programming languages used: Python, R, Matlab, C++(Basics), JAVA(Basics)

Tools: Minitab, Spotfire, Tableau, Excel (VBA, Macros, VLookup, Pivot tables), Hadoop, MapReduce, Cplex, Gurobi

Database management systems: IBM DB2, Teradata and MongoDB

Academic skills: Branch and Bound, Linear Optimization, Design and analysis of experiments, Programming for Analytics, Random Forest, Regression models, Decision Trees, Support Vector Machines, Logistic Regression, Markov Chains, and Time Series Modeling, ETL( Basics)

Relevant Experience

Norfolk Southern Corporation, Operations Research Analyst, ATLANTA Feb 2017- May2017

•Provided reports on extra train problem based on careful data analysis using python and R to improve accuracy of customer’s demand forecast.

•Performed data visualization on spotfire by creating dashboards to extract valuable business insights.

•Worked as QA analyst for intermodal equipment forecasting system to test and guarantee the quality of the system.

•Retrieved and monitored real-time intermodal data with SQL queries in DB2, Teradata, NoSQL in Mongo and completed different tests using statistical methods and improved applications forecast data by 23%.

University at Buffalo, Operations Research Student, BUFFALO Dec 2015- Sept 2017

Explored a method that strategically places inspection stations while requiring that each hazmat truck visit an inspection station after driving a given specific distance

Collected and sorted data in R and analyzed solution procedure in python and then applied to a real-world case study.

Compared this risk control method with previously studied risk management approaches such as road bans/curfews and toll setting.

Presented poster at INFORMS conference 2016 at Nashville.

Larsen & Toubro Limited, Data Analyst Intern, INDIA May 2012-July 2012

Used SQL queries to extract the data from the database and analyze the quality of the results.

Performed regression test on the data to provide the prediction for the tool material and size for particular factor of safety.

Relevant Projects

Parallel processing of Bigdata and Data visualization

Performed data analysis on UB classroom and semester data from 1931 to 2017 using Map Reduce (MR).

We have ingested the results of MR analysis to create different visualization dashboards using Tableau.

Time series analysis on US-Canada border crossing data

Extracted data from the Bureau of Transportation Statistics website and manipulated the data using Microsoft Excel and then extensive data analysis was carried out on the training set data using R.

The best-fit model for each of the three time series models ARIMA, SARIMA and holtwinters was calculated and tested on testing set data to find the accuracy of each model. Forecasted the number of passengers and vehicles crossing the border for next 5 years.

Titanic: Machine Learning From Disaster – Classification

Implemented exploratory data analysis, feature engineering, feature transformations, feature extraction, missing value imputations, cross-validation and model ensembles for better accuracy using python.

Applied different predictive models like Logistic regression and Random Forest to predict survivors from Titanic shipwreck.

