Post Job Free
Sign in

Data Science, Data Analysis

Location:
Fremont, CA
Salary:
80000
Posted:
January 26, 2021

Contact this candidate

Resume:

Suryateja Gudiguntla

***** ********* *******, *** ***, Fremont CA 94536 **********@*****.*** +1-857-***-**** EDUCATION

Boston University, Boston, MA September 2018 - May 2020 Master of Science in Computer Science with CGPA of 3.51/4 Graduated with a concentration in Data Analytics, additional focus on subjects of Cloud Computing, Big Data and AI. Relevant Courses: Cloud Computing, Big Data Analytics, A.I., Data Science, Data Mining, Data Analysis, Web Mining Vidyalankar Institute of Technology, University of Mumbai, India August 2014 - June 2018 Bachelor of Engineering in Computer Engineering with CGPA of 7.51/10 Relevant Courses: Machine Learning, Artificial Intelligence, Data Structures, Analysis of Algorithms, Soft Computing, Object Oriented Programming Approach (Java), Software Engineering, DBMS, Theoretical Computer Science SOFTWARE SKILLS

• Programming Languages: Python, Java, R

• Libraries: sklearn, pandas, numpy, pytorch, torchvision, cherrypy

• Tools: AWS, Kubernetes, Apache Spark, Solr, Postman, Hadoop, Tableau, Google Analytics, WEKA

• Web Technologies: HTML, Py-Flask, CSS, Bootstrap, JavaScript

• Augmented Reality using Vuforia and Unity

TECHNICAL PROJECTS

Exploring AWS Spot Instances within Kubernetes Clusters using GoLang Provide a cost-effective way (while maintaining SLA) of running a Kubernetes cluster using AWS EC2 Spot Instances which are more economical than the default Kubernetes On-Demand instances.

• Designing a control logic using GoLang that can manage termination of nodes and can add new nodes as required

• This open source project is mentored by professionals from RedHat and presented at RedHat Summit 2019. Finding Protest Activity and Estimating Perceived Violence in Social Media Data Developed a visual model which can recognize protesters, describe their activities by visual attributes and estimate the level of perceived violence in an image.

• Used Convolutional Neural Networks (CNN) resnet50 model to recognize all the required classes from each image based on the visual attributes.

Phrase Sense Disambiguation

Phrase sense disambiguation proposed a method to disambiguate sentences by recognizing the meaning of the phrases in the sentence and, hence understanding the context.

• Data Curation and Phrasebase Generation performed using a multi-node Hadoop cluster setup configured in Java

• Ambiguity resolved using rank and count of phrases and using Tagme annotator to connect to the Wikipedia links

• Entity disambiguation is performed by recognizing the context, better results than existing probabilistic systems Home Credit Default Risk

Identify if loan applicants are capable of repaying their loans based on the data that was collected from each applicant.

• Incorporated PySpark machine learning pipelines using Apache Spark and applied the stages created using PySpark ML feature methods to create the features

• Used Logistic Regression and Gradient Boosting Trees classifiers, and compared on Area under ROC metric

• Implemented hyper-parameter tuning using grid search with cross validation to improve the performance of GBT EXPERIENCE

DataWeave, Newark, CA August 2020 – Current

Data Engineering Intern

• Created various text semantics APIs using CherryPy and tested them using Postman API testing

• Performed text-based feature extraction on product information to match products across multiple ecommerce websites using various clustering algorithms and ranking them using tf-idf weights, using Python and Solr.

• Benchmarked the various algorithms and now working towards scaling it using Apache Spark with AWS DataWeave, Bangalore, India June 2016 – July 2016

Data Curation and Assimilation Intern

• Assimilated and curated products using Product Type Classification and Product Attribute Normalization for use in further stages of the Machine Learning process

• The project developed in this duration understood this data and gave insights which helped Amazon to better understand their competition, optimize the offerings for their customers, and increase sales



Contact this candidate