HA RSHAA N. BA JAJ
• *** Park Dr, Boston, MA ****5 • 312-***-****
************@*****.*** w ww.linkedin.com/in/harshaa-bajaj-24s93 h ttps://github.com/HarshaaBajaj OB JECTIVE
Improvise and expand my skill set by using it towards the growth of the organisation. ED UCATION
Il linois In stitute o f Te chnology Ch icago, U.S.A Master of Science, Data Science Aug’16 - May’18(e xpected) Academic Highlights: Statistical Learning, Advanced Database Organization, Data Preparation and Analysis, Applied Statistics, Machine Learning, Software Analysis, Project based problem solving. Au rora’s S c ientific, T e chnological and R e search A c ademy H y derabad, I n dia B.Tech, Computer Science Aug’11-May’15
Academic Highlights: DBMS, Probability & Statistics, Analyzing Algorithms, Finite Automata, Object Oriented Languages,Mathematical Foundation of CS, IR Systems, Data Warehousing and Data Mining. EM PLOYMENT H I STORY
Sc hlumberger- Do ll Re search ( M assachusetts, U.S.A D a ta Sc ience I n tern M ay’17-Aug’17 Understanding the relations between variables to implement Hierarchical Bayesian networks on the data gathered from the Digital Well Logs using Netica for evaluating the cement conditions and the bond between the cement, casing and rock formation in boreholes. Am azon ( H yderabad, India) Ri sk A n alyst
Aug’15-July’16
Inspected real time transactions to detect fraud patterns and fed them in system software for automatic detections. Awarded the ‘Quality Guru’ for initiating a new program wherein any transaction tagged by the word ‘EGCNTF’ would be pulled out of the system for reinvestigation and follow up. PR OJECTS
Pr evalence of Di abetic R e tinopathy F eb’17-May’17
(Collaborative project with Illinois College of Optometry on clinical data) Built KNN, Multinomial naive Bayes and SVM models on the selected features after cleaning and exploratory analysis for classification of different types of DR. Pr edicting M o vie G e nre F eb’17-May’17
Use of ensemble methods over classification algorithms to predict the genre of a movie based on multivariate dataset gathered by web scrapping the IMDB website. Es timation of S u rging P r ice for C a b A g gregator S e rvice Jan’17 Based on attributes like the lifestyle index, trip distance, cab type etc, estimated the type of surging price by fitting random forest model, naive Bayes and neural network. Pr obability of D o nating B l ood N ov’16
Finding the patterns in blood donation history by statistical inferences to estimate the probability for blood being donated in the upcoming session using knn classifier. Pr ediction of B i ke R e ntals O ct’16
Modelled the dataset using different algorithms and concluded that since the response variable was of count type with overdispersion negative binomial was most appropriate. TE CHNICAL S K ILLS
•Languages:C,C++, Java, Python,R,MATLAB. • Web Designing: Java Scripting, HTML, XML.
•Operating Systems:Windows 10 and older versions,Linux. •DBMS Packages: SQL,SQL Plus,PL/SQL.