Exploratory Analysis - CDSS Hackathon for Walmart Data, October 2016
Discovered complementary and cannibalistic relationships between products in the dataset by inferring from the correlation values between the sale of a product and its in-stock quantity. Explored the use of such relationships in inventory management. Secured 2nd place in the hackathon. Topological Data Analysis of Cancer Genes, October 2016 Used a topological data analysis method known as Mapper in conjunction with data on gene functions as well as the pathways where they operate to gain biological insight from Gene expression analysis. Analysis of New York City Taxi Data, November 2016 VIJAYARAGHAVAN BALAJI
EDUCATION
Current – 12/2017
Master of Science: Data Science
Columbia University, Fu Foundation School of Engineering and Applied Sciences - New York, NY. Coursework: Statistics, Machine Learning, Deep Learning, Personalization Theory, Data Visualization 2016
Bachelor of Technology: Computer Science, Manipal Institute of Technology - Karnataka, India CGPA 9.26, University Rank #8, Student Placement Coordinator 05/2017 to Current
Data Science Intern
SAP, Palo Alto, California, United States
Image classification using Convolutional Neural Networks Built and trained a neural network to classify input images from across 100 categories. Studied and implemented Spatial Pyramid Pooling as a pooling method to remove the constraint of same size training images. Root Cause Analysis of Alarms triggered in Retails Giant Performed NLP and text analysis on descriptions of work orders issued when an alarm was raised. Detected the most probable cause of alarm, for each type of alarm.
Automatic Speech Recognition
Used Mel’s Frequency Cepstral Coefficients to extract features from audio signals and employed Dynamic Time Warping as a similarity measure.
01/2016 to 05/2016
Business Intelligence and Analytics Intern - Manufacturing Group Tata Consultancy Services - Mumbai, India
Predictive Maintenance of Club Cars
Identified historic part failure and insurance claim patterns of Club Cars data to accurately estimate emerging patterns of part failures using reliability life data and Weibull Analysis. Probabilistic Risk Assessment for Industrial Safety Recorded and analyzed incident data and predicted future pattern of incidents. Assessed job-wise risk profiles and injury risks for each job and compared them using injury count model, injury risk model and derived statistics such as characteristic life and potential number of injuries. 06/2014 to 07/2014
Digital Marketing and Consumer Behavior Intern, Tata Consultancy Services - Mumbai, India WORK HISTORY
DATA SCIENCE PROJECTS
Python full stack (Pandas, Numpy, Scikit, Keras)
R, SQL, C/C++, Tableau
SKILLS
114 West 109th Street, Apt 3c, New York, NY 10025
*************@*****.***, github.com/vijaybalaji30
https://www.linkedin.com/in/vijayaraghavan-balaji-294aa7b6