Post Job Free

Resume

Sign in

Data Software Developer

Location:
Brooklyn, NY
Posted:
June 22, 2018

Contact this candidate

Resume:

Gaurang Vilas Gatlewar

Address: **, ****** *, ********, ** 11223 Email: ac5yzz@r.postjobfree.com Tel: +1-347-***-**** LinkedIn: https://in.linkedin.com/in/gaurang-gatlewar Github: https://github.com/GaurangGatlewar HackerRank: https://www.hackerrank.com/Gaurang_Gatlewar Education: Computer Science

New York University (2018)

Masters of Science[3.5/4]

Indian Institute of Technology (2016)

Bachelor of Technology[3.5/4]

Technical Skills:

Languages: C, C++, Java, Python

Web Technology: HTML, MySQL, PHP,

CSS, JavaScript, Flask.

Tools: MS Office, Amazon AWS,

MATLAB, AutoCAD, CATIA, Wireshark,

Jupyter Notebook, VirtualBox, GitHub,

Metasploit, Hadoop, MapReduce, Pig,

Spark, Docker, Tensorflow.

OS: Windows, Linux, Mac.

Coursework:

New York University:

Machine Learning

Deep Learning

Introduction to Data Science

Programming for Big Data Analytics

Design and Analysis of Algorithms

Data Structures and Algorithms

Penetration Testing

Network Security

Computer Networking

Introduction to Operating Systems

Computer Architecture

Coursera:

Python Data Structures

Using Python to access Web Data

Using Databases with Python

Cryptography 1

Algorithms, Part 1

Algorithms, Part 2

Neural Network and Deep Learning

Machine Learning with Big Data

Extracurricular Activities:

2400th in India in IIT-JEE (99

percentile).

Ranked 606th (96.48 percentile) in

IMO (International Mathematics

Olympiad).

Event organizer, NYU Center for

Cybersecurity.

Selected by Goldman Sachs as a

Quantitative Analyst.

Departmental Representative in

Department of Undergraduate

Committee.

Ranked 214/2453(Gold) in

NCL CTF 2018.

Coordinated with Kiran Society (NGO)

and improved the KAFO (leg caliper)

design.

EXPERIENCE

Revmax Corp, New York, USA

Software Developer Intern May 2017-July 2017

● Scraped 7+ websites using BeautifulSoup library and extracted json data from 15+ rest APIs to gather 25+ GBs of data spanning 30 years.

● Performed ETL by setting up a cron job using Amazon AWS Lambda function using a python script along with its dependencies in a virtual environment resulting in an automated data collection process.

● Co-ordinated with a team to setup a ML model to predict the probability of hiring a cab and achieved an 87% cross-validated accuracy in 30% of the area which was determined by setting up an 80% confidence level threshold. Larsen & Toubro Infotech, India

Research Summer Intern May 2013-July 2013

● Coded in MATLAB and C/C++ to design Whitcomb Rule compliant UAV fuselage reducing drag thus leading to supersonic flight capability.

● Modified design to include delta wings with NACA airfoil providing better gliding capacity increasing the flight duration by 30%.

● Simulated the flight in MATLAB flight simulator and optimized parameters to maximize the efficiency.

ACADEMIC PROJECTS

Taxi Demand Prediction:

Technologies Used: Python, PySpark, MongoDB, Web Scraping, Hadoop, R, Django, Rest APIs, Data Analytics, Machine Learning, MapReduce, Pig, Cluster Analysis.

● Performed ETL to gather data(>10GB) from 3 data sources.

● Cleaned, modified and merged the data into a single dataset for prediction model also making it scalable for Big Data.

● Predicted demand using Random Forest Classifier achieving a 90% accuracy.

● Optimized the model using PCA and GridSearchCV increasing the accuracy to 93% cross-validated accuracy.

Twitter Data Analysis:

Technologies Used: Python, Twitter API, Machine Learning, Jupyter Notebooks, Numpy, Pandas.

● Identified twitter “Bot” profiles using Machine Learning Classification algorithms based on the attributes like name, location, verified, etc.

● Extracted tweets using Twitter Rest API and processed them using Python libraries such as pandas and numpy in Jupyter Notebook.

● Tested multiple algorithms and determined the best suited algorithm.

● Optimized the performance using GridSearchCV to predetermine the optimal input parameters which increased the probability by almost 2%.

● Ranked 1st in Kaggle competition by achieving >95% accuracy. Artificial Neural Network based Galling frequency estimation: Technologies Used: Matlab Neural Network Toolbox, C/C++.

● Gathered, Validated and Tested data against 12 algorithms for best estimation and chose Levenberg-Marquardt and Bayesian Regularization algorithms based on the minimum error criteria using MATLAB Neural Network Toolbox in C programming language.

● Tested multiple Neural Networks and finalized a 3 layered neural network and validated the results with available data to generate a Neural Network capable of >90% accuracy.

Google Landmark Recognition Challenge:

Technologies Used: Python, Tensorflow, NYU HPC.

● Created a dynamic batch processing python code to manage 400 GB data.

● Created a deep learning CNN to classify landmarks in 15k categories.



Contact this candidate