Objective: Seeking entry level/junior Data Scientist position in the field of data science, using my data science skills such as: analytical, statistical methods, manipulating, mining large datasets, reporting, and documenting over all 2+ years of experience.
Education:
Masters in Computer Science, Rowan University, Glassboro, NJ. May 2017 Relevant Coursework: Introduction to Statistical Data analysis, Data Warehousing, Data Mining, Data Quality and Web/Text Mining, Machine Learning, Concepts in Artificial Intelligence.
Post Graduate Diploma in Computer Applications, CMC Academy, Hyderabad, India. May 2015
Bachelor’s in Electronics and Communications Engineering, JNTU, Hyderabad, India. May 2014
Skills:
Programing Languages: Python (NumPy, Pandas, Scipy), R, Tableau, Java, SQL, PL-SQL, PHP, HTML5, XML.
Tools: Eclipse, Pycharm, Git, PowerBI, R Studio, Anaconda, Excel (macros, pivot tables)
Database: Oracle 11g/12c, MySQL, MS SQL Server 2010.
Expertise: Machine Learning, Exploratory Data Analysis, Text Mining and Predictive Analytics, Time series analysis, hypothesis testing.
Work Experience:
Java Developer Intern: Iomega Technologies July,2017-Present
Attended trainings on Java, Advanced Java, Spring, Hibernate Frameworks, Web Services.
Involved in planning and estimation of project Artifacts.
Created new tables, sequence and written SQL queries in Oracle.
Developed User Interface using HTML, JSP, Java Script and JQuery.
Developed DAO’s for retrieving the data from the database.
Implemented MVC design pattern using Spring framework.
Academic/ Research Projects
Data Analyst: Predictive Analysis on Diabetic Patient Feb-May 2017
The objective of the project is to predict the time-period of a diabetic patient under in a hospital
Performed data cleaning, built a predictive model using predictive methods (Random Forest, Support Vector Machine and Neural Network) using R-Studio.
RF had shortest runtime, SVM had comparatively higher runtime, NN was slowest with the least error rate of 14.5%, thereby being better than the other two.
Data Analyst: Research Project Nov–Feb 2017
Served as a Data Analyst for building a prediction model to forecast currency exchange rate by analyzing time series data.
Identified key factors influencing currency trend, performed data collection, data preprocessing, ETL, exploratory data analysis, classification, time series analysis, K fold Cross validation to identify the future trends of currency.
Used regression analysis, Decision tree, Random Forest, Ensemble methods, time series ARIMA and conducted performance metric analysis to find the best model to forecast currency trend
Data Analyst: Sentiment Analysis (Web Text Mining) on Smart Phones August-Dec 2016
The objective of the project is to analyze the user sentiments based on tweets on the high-end smart phones such as iPhone 6/7, google pixel, OnePlus products.
Analyzed 68780 tweets harvested over three months. Implemented web crawler using twitter API. Used Naïve Bayes, Voter algorithm to find the polarity & emotions of the tweets.
Implemented word cloud to visualize term document frequency(TDF) of tweets.
Analysis on University Data for Admissions Feb-May 2016
The objective of the project is to collect, analyze, University dataset. Used Association rule mining, Parallel Co-ordinate Plots and Principal Component Analysis to analyze the data set.
The colleges were well classified based on their academic environment, quality of life and other important factors such as cost of education, student population etc., which would matter to the students.
Connect Four-Artificial Intelligence August-Dec 2015
The objective of the project is to implement the Artificial Intelligence concepts using min-max algorithm.
Connect Four is a tic-tac-toe like game in which two players plays by dropping the discs in to a 7x6 board.
Instead of Brute force search, using knowledge based approach trains the machine.
Automatic Database Schema Generation March-June 2015
Served as developer to build Graphical user interface for performing Database CRUD operations.
This application uses Oracle 10g Database.
Application was developed in Java.
Certifications:
Introduction to R -Data Camp
Intermediate R -Data Camp
Importing Data in R -Data Camp
Data Cleaning in R -Data Camp
Introduction to SQL for Data Science -Data Camp
A-Z Machine Learning -Udemy (In Progress)
Data Visualization and Communication with Tableau -Coursera (In Progress)
Languages: English, Hindi, Telugu, and Urdu.
Additional Information:
Eligibility: Eligible to work for any employer in USA on OPT-EAD.