Business-minded data scientist with a demonstrated ability to deliver valuable insights via data analytics and advanced data-driven methods, 6+ years of experience in delivering Data driven solutions for Enterprises. Seeking a job opportunity for Data Scientist and Analyst position. Education

Indiana University, Bloomington, IN May 2018

(M.S.) in Data Science, GPA 3.475/4.0

Vellore Institute of Technology, Vellore, India May 2010 Bachelor of Technology in Electronics and Communication, GPA 8.4/10 Skills and Tools

■Data science: Classification and Regression Model Design, Predictive Modeling, Decision Analytics.

■Business: Delivery Excellence, Collaborating with Business, Building solution in an iterative model to deliver value.

■Development: BlockChain Technologies, Machine Learning Algorithms, Performance and scheduling-ETL techniques

■Data: NoSQL (MongoDB), Apache Pig, MatLab, Oracle 11g, Informatica, HDFS

■Programming: Python, R, Solidity

Academic Projects and Coursework


Foundation of Data Science, Machine Learning, Data Mining, Time Series Analysis, Statistical Learning and High Dimensional Data, Artificial Intelligence, Cryptography, Machine Learning for Signal Processing. Image compression (MATLAB, Singular Value Decomposition, Python)

Image Compression to minimize the size of an image without degrading the quality of the image to an unacceptable level

SVD applied using various rank (k) to select an appropriate number of singular values to represent the original image. Signal Data Reduction using Vector quantization (Python, k Mean, Gaussian Mixture Model(GMM), and Clustering)

Clustering (K mean and GMM) applied to audio signals for reducing the data, testing performance on Deep Neural Network for comparing structure approximation like STOI, PTSQ. Smart Contract Voting for Cryptocurrency popularity (Solidity (Remix), Geth, Mist, NLP, Python)

Smart contract using voting mechanism on BlockChain, for checking the popularity of the new cryptocurrencies

Data feeds from the BitCoin Magazine and Twitter used and processed using Natural Language processing for results. Large Scale Hierarchical text Classification (k Mean, LDA)

Implemented hierarchical K-Means clustering and LDA modeling for 200,000 text documents resulted in 78% accuracy. Professional Experience

Barclays Technology Center India, Business Analyst Jan 2015 –Jul 2016

Middleware architecture designed and implemented for ABSA (Banking domain) and Fraud EVision for Data Analysis and Visualization using Informatica and MS Excel for data set of approx. 10 million records.

Built a prototype for a new analytics platform with large scale operations on data in MongoDB using Talend, and clover integrated with Cloudera Hadoop v4.7.1.

Deloitte Consulting India Private Limited, Business Technical Analyst Mar 2014 –Jan 2015

Built a data pipeline solution to process job resumes, applying Naive Bayes classifier using python to separate out relevant details such as Job title, salary, years of experience for easy querying. Capgemini India Private Limited, Associate Consultant Nov 2012 – Mar 2014

Implemented data pipelines of US/UK/IR/GR FATCA service for complex business logics.

Working collaboratively with Barclays Enterprise Middleware team built a warehouse architect model design in ABSA on phase I implementation.

Infosys Ltd, Systems Engineer Jun 2010 – Nov 2012

Delivered multiple projects for Data Integration and Migration on XML and other Database System for Capital One financial services, Portfolio Recovery Associates using Informatica and Ab-initio as ETL tools. Teaching Appointments

Associate Instructor for Undergraduate courses in SQL and PHP at Indiana University. Aug 2017 – Present

Teaching Assistance – Onramp Course in Machine Learning with Java, Kaggle Cases. Jan 2018 – Present

