Post Job Free
Sign in

Data Computer Science

Location:
San Jose, CA
Posted:
April 03, 2018

Contact this candidate

Resume:

SHRAVAN NAIK

PROFESSIONAL EXPERIENCE:

University of Illinois, Springfield Mar 17 – Dec 17 [Role: Graduate Assistant]

Analyzed survey data to derive insights into business metrics and better understanding of customer experience

Performed large-scale data analysis, data integration and developed statistical models for classification and optimization by using database query

Conducted sentiment analysis over customer surveys and performed cleaning, transformation, sampling of data to derive statistically significant results

Performed data mining (SQL, ETL, data warehouse) and Integrated with Tableau to generate visualization

Dena Bank, Bengaluru, India Aug 15 – Jun 16 [Role: Junior Analyst]

Worked with Project manager to Identify best approach for gathering requirements

Gathered Business Requirements, created Functional Requirement Document (FRD) and analyzed data/workflows, defined the scope, financial projections and cost/benefit analysis; progressed from problem statement to well documented designs. Prepared user guidelines for easy access of the application

Used RUP-iterative process to conduct data analysis to find missing data fields in the application and customize them extensively using Team Foundation Server (TFS)

Used Agile Scrum methodology to define scope of proposed solution

Designed and developed use cases, Activity Diagrams using UML and Business Process Modelling

Used MS Word & MS Visio to document data flow of the AS IS process and TO BE process

Conducted JAD sessions to get SME’s input on how to implement the application

Performed Data Mapping of existing data by retrieving data through SQL queries

Analyzed Business Requirements from Black Box testing perspective; designed and developed Test plan and Test Case documents

ACADEMIC CREDENTIALS:

Master of Science, Computer Science Dec 2017

University of Illinois at Springfield, IL

Bachelor of Engineering, Computer Science May 2015

Sambhram Institute of Technology, Bangalore, India

SOFTWARE EXPOSURE:

Programming

JAVA, R, Scala, Spark

Database

SQL Server, Oracle 10g, Hadoop, R-Studio, NetBeans, Eclipse

Tools/Platforms

Tableau 10, MS Office (Word, Excel, PowerPoint, Access, Visio, Project), Agile, Waterfall

Key Skills

ERP Systems, Statistics, Data Analysis, Business Requirement Gathering & Documentation, Data Visualization, SQL

ACADEMIC PROJECTS:

Binary classification on a sample of US population for voting process [R, R studio]

Imputed missing data, performed classification techniques: KNN, Naïve Bayes, Decision Tree, Neural Network

Applied 10 - fold cross validation to check model performance.

Applied Parameter Tuning method: Ensemble learning (Bagging) for parameter tuning

Applying k-Nearest Neighbors to predict MPG [R, R studio]

Converted attribute horsepower from character to integer

Imputed missing values. Investigated the association between mpg and other features

Applied k-NN on the training data, with several values of K, in-order to predict mpg

Spam filter based on a Naïve Bayes classifier [R, R studio]

Applied Naïve Bayes classifier for text classification on the train and test dataset to train and test the performance

Built word clouds for the training ham set and for the training spam set to compare them

Used VCorpus to build a volatile corpus from a directory to combine corpora

Linear Regression on a sample of Boston dataset [R, R studio]

Fitted a multiple regression model to predict the response variable

Fitted a smaller linear regression model that only uses predictors for which there is evidence of association

Reduced the set of predictors by making correlation and scatterplot matrices

Flight Dataset Design and Implementation [SQL, MS Access]

Normalized data, developed entity relationship diagram, developed schema for every table

Implemented database in Microsoft Access and developed reports and forms.

Weighted page rank algorithm on ACM citation graph [HDFS, Hive, Spark, Scala, Eclipse]

Performed data extraction and cleaning

Build citation graph, retrieved incoming reference link to calculate in-degree distribution

Applied weighted page rank function and derived top 10 Page ranks

User-based recommendation system – Apache Mahout [HDFS, Hive, Spark, Scala, Eclipse, Mahout, Java]

Calculated standard error and RMSE error

Implemented ALS recommendation system in spark and calculated ranks for different users in spark

Average flight delay calculation of sample data of flights in United States – Map Reduce Framework [Java, Eclipse, Hadoop]

Performed data extraction on mapper

Performed average delay calculation on reducer side

Twitter Social Graph Analysis – Symmetric and Asymmetric pairs in twitter data [Java, Scala, Spark, Eclipse]

Generated symmetric pairs of users using map reduce framework in Hadoop

Generated asymmetric pairs of users in apache spark using Scala programming language



Contact this candidate