Post Job Free
Sign in

Data Scientist, Python, R, SQL, Statistics, Machine Learning

Location:
San Mateo, CA, 94403
Posted:
August 12, 2020

Contact this candidate

Resume:

Gowtham Teja Kanneganti

Highly motivated professional with experience delivering data-driven analytical solutions for business problems. • *********@*****.*** • +1-405-***-****

• linkedin.com/in/gowtham-kanneganti/

• gt0410.github.io/ • github.com/gt0410

Skills

• Software/Tools: Python, R, SQL, Tableau, Power BI, Java, C++, Spark, Hadoop, Git, Linux

• ML Techniques: Deep Learning, Statistics, TensorFlow, Keras, Computer Vision, NLP, AWS Sagemaker, Azure ML

• Certifications: Tableau Analyst, Deep Learning Specialization, Big Data Hadoop and Spark, Power BI Training

Education

Master of Science in Data Science and Analytics, University of Oklahoma GPA 4.0 Aug 2018 - May 2020

Bachelor Engineering in Mechanical Engineering, Birla Institute of Technology, India GPA 3.5 Aug 2013 - May 2017

Work Experience

University of Oklahoma, Norman, OK, US

Graduate Research Assistant 01/2020 - Present

Training SARIMA, LSTM, CNN-LSTM models on time-series data to predict travel time on roads that helps to make decisions on road construction. Research on machine learning techniques that helps the department of transportation.

University of Oklahoma, Norman, OK, US

Graduate Teaching Assistant 01/2019 - 12/2019

Responsible for designing and grading assignments, organize labs, and hold office hours for the following courses:

• Intelligent Data Analytics • Python Programming – Computer Science

National Weather Center, Norman, OK, US

Research Assistant 09/2018 - 01/2019

Developed software to visualize and predict the aircraft trajectory. Mapbox and HTML are used to visualize air traffic. Implemented text mining techniques to scrape datasets, and this software help to operate UAVs in a safe zone.

IFB Industries Ltd, Bengaluru, India

Graduate Engineer 07/2017 - 06/2018

Increased the tool life by 20% using statistical hypothesis testing. Presentation of monthly manufacturing reports using data from the SQL database to communicate technical information. Forecasting product sales.

Projects

Thesis: Detection of Overshooting Cloud Tops (OTs) with Convolutional Neural Networks, Python, TensorFlow Report

Combined satellite images having different resolutions using interpolation algorithms. Trained data using Imbalanced machine learning techniques as the occurrence of OTs is a rare event. Developed a CNN model that detects OTs with a recall of ~92%.

Sentiment Analysis on IMDB movie reviews, Python, AWS, Keras, Natural Language Processing

Trained model on 50K movie reviews that classifies the sentiment of a review with ~93% accuracy. Worked on word embeddings and compared the performance of LSTM and CNN-LSTM architectures.

TMDB Box Office Prediction, R, Text Mining, Predictive Modelling Kaggle Git

Predicted box office revenue of movies with an RMSE of #0.2069. Performed EDA, Data Visualization, Feature Engineering. Gradient Boosting Machine (GBM) performed best. Comparison of performance of different regression algorithms.

DS for Good: City of LA, Python, Text Analytics Kaggle Git

Converted plain-text job postings into a CSV file to improve the quality and diversity of applicant pool. Extracted data using Regex, Text Mining, Visualization, lemmatization, TFidf Vectorizer, and word2vec is used for employment recommendations.

Redactor/Unredactor Text Analytics, Python, Text Analytics, Machine Learning, Classification Git

Redacted sensitive information in files using Regex and nltk package. Developed a Random Forest Classifier to identify redacted words. Performed text normalization, text vectorization, named entity recognition, POS tagging, and Tf-Idf to extract features.



Contact this candidate