Sign in

Data Engineer

Piscataway Township, New Jersey, 08854, United States
January 25, 2018

Contact this candidate



** ***** ******, *** *********, NJ - 08901 Email:

GitHub Profile Rpubs Profile LinkedIn Mobile: 201-***-**** EDUCATION

Masters Information Technology, Rutgers Business School, New Jersey Dec 17 Coursework: Data Mining, Data Analysis and Decisions, Machine Learning, Business Forecasting GPA: 3.83/4.0 Bachelors Computer Engineering, University of Mumbai, India June 14 Coursework: Adv. Database Mgmt. System, Data Warehouse & Mining, Data Structures, Software Architecture SKILLS

Development and Statistical Programming Languages: R, Python, Java, SPSS, SAS Machine Learning Skills: Linear & Logistic regression, Decision trees/Forests, Clustering, Classification etc. Database tools & technologies: MySQL, SQL Server, Mongo DB, Oracle, MS Access Experience in Data Mining, ETL and Visualization Tools: Tableau, WEKA, Excel, Microsoft Office Python Packages & API’s: Numpy, Pandas, Scikit, BeautifulSoup, Selenium, REST API, TwitterAPI, YelpAPI Tools and frameworks: Jupyter Ipython notebook, ShinyR, RStudio, IntelliJ, Eclipse, Git, Maven, SVN WORK EXPERIENCE

Big Data Engineer Intern, Siemens Corporate Technology, New Jersey May 17 - Aug 17

• Developed Business Analytics and Monitoring dashboard for client and implemented advanced statistical methods for prediction and optimized error detection in logs and presented reports to non-technical stakeholders

• Applied Clustering techniques to structured and unstructured log transactions and grouped them and resolved errors giving simple recommendations

• Created visualizations through Tableau to showcase features in the data and identified measures impacting business

• Performed data extraction from a DarkTrace API by writing REST Client library and analyzed data for security breaches

• Examined & studied incoming bugs on JIRA’s, fixed them and leveraged the test environment to quickly test thus decreasing the run time by a margin of 90% and increase the coverage by 80% Graduate Research Assistant, Rutgers University, New Jersey Aug 17 – Dec 17

• Coalesced multiple files & executed data wrangling and transformation using Pandas library of file sizes equivalent to 6GB

• Implemented web scraping & extracted data on physician’s background using BeautifulSoup python libraries

• Constructed REST API calls & studied prescribing behavior of physicians and perceived the similarity of nature in treatment Systems Test Engineer, Infosys Limited, India June 14 - May 16

• Developed Java based Financial Services application for enrichment of data files and reporting of eligible trades

• Improved and enhanced data analytics backend systems by automating report generation for financial transactional data in a specific format to regulators at the End of Day thereby boosting the efficiency by 30%

• Led team to achieve timely deliverables by providing technical and functional expertise & being involved in all SDLC cycle ACADEMIC PROJECTS

Predicting Likelihood of Patient Visit (R, ShinyR, Machine Learning, Classification) Dec 17

• Developed shinyR app to predict odds of patient keeping their appointment & performed exploratory analysis of features

• Implemented statistical solutions & determined which patients are at risk of not making their appointment during new registration thus preventing loss of revenue & saving doctors time Twitter Ranking Experts (Python, Machine Learning) May 17

• Extracted tweets using TwitterAPI and designed application to rank different users based on their domain expertise

• Implemented supervised learning techniques like feature extraction & used SVM, Random Forest ranking algorithms Predictive Analysis of Default of Credit Card Clients (R, Tableau, Weka) Nov 16

• Cleaned, inspected and manipulated imbalanced credit card data from UCI repository using under and over sampling

• Implemented Logistic regression, KNN, RandomForest on data to obtain statistically significant results Twitter Analysis of Top Trending Topics (Python, TwitterAPI, Natural Language Processing) Oct 16

• Examined, analyzed hash tags using Twitter API between different presidential candidates & performed NLP text mining

• Performed targeted marketing (sell merchandise) campaigns using insights derived from sentiment analysis of user’s tweets LEADERSHIP SKILLS/ACTIVITIES

• JT. GENERAL SECRETARY of the Students' Council during 2012-2013

• Teaching Assistant for Business Forecasting course during Fall 2017

• Med. Math Tutor for Rutgers Nursing School during Fall and Spring 2017

Contact this candidate