Vigya Shrote
551-***-**** ******@***.*** LinkedIn GitHub
New York, New York
Education
New York University, Graduate School of Art and Science (GSAS), New York City, NY, US Master of Science in Information Systems CGPA:3.43/4 May 2018 Mukesh Patel School of Technology Management and Engineering, NMIMS, Mumbai, INDIA Bachelor of Technology in Electronics CGPA: 3.42/4 May 2012 Skills
Languages - R, Python, SQL, C#
Data Analytics and Visualization Tools -Tableau, R-Studio, Microsoft Power BI, Weka, Google Analytics, MS Excel (Data Analysis) R Packages – ggplot2, plotly, DT, dplyr, tm, quanteda, tidytext, wordcloud, leaflet, geosphere, shiny Python Packages - pandas, matplotlib, numpy, sklearn Databases – MySQL, Mongodb, PostgreSQL.
Professional Experience
Data • • • • • • • • • • • Analyst Analyze Perform Identify Research Analyze Analyze Confidence Automate Train Create API Intern testing part-test issues regional and core and the on timers at using cases Score extract report trends engine Itemize, Audit with on audits Postman and analysis trends the process conducting in meaning issue perform different Optical New (USA, in and for using operational York and UK different Character QA train country audits utility and MS for City, core Web/Excel for Australia) data receipts from data New engine Recognition true Mobile sets large using positives York and to for and data app make Excel Paper present identify (OCR)and sets and it Pivot receipts more in-training and, it ways Machine house to Tables conduct the robust to and the team audit improve June Learning Hotel and core continual tool 2017- smart engine folios the (ML) scoring Present audits using and the to process Merchant demonstrate in-house for Service each audit results document tool (MS) Freelancer, • • Worked Analyzed New York as social a Data City, media Analyst New traffic York for using a clothing Google brand. Analytics and created visualizations March 2017 for the same Software • Engineering Simulated real-Analyst time scenarios at Accenture, and performed Mumbai, Functional, India Regression, End-July to-2012 End, - May Integration, 2016 Load, Compatibility and Exploratory test
• • • on Automated Was Wrote the a SQL tool. part the of scripts an testing Agile to extract of environment Project data Management in and a simple practiced and tool effortless Agile using Software C# way and Microsoft Development Visual methodologies. Studio Coded UI Academic Projects
Text Mining and Sentiment Analysis for Kickstarter Projects Language - R: Analyzed the description of kickstarter projects to identify commonalities of successful and unsuccessful projects using the text mining techniques R has to offer. Also did sentiment analysis and created word cloud for the same using tidytext, quanteda in R. Fires in NYC and FDNY responses Language - R: Wrangled and analyzed NYC firehouse dataset and created visualizations to investigate serious fire incidents requiring the fire department to respond. Also created a map visualization of the response time. The Science behind Speed Dating Language - Python: Created a predictive model using machine learning library, Scikit learn to analyze the attributes most relevant for successful dating. Also created a descriptive model to visualize the preference of opposite sex. NYC Good Neighborhood Index Language - R: Wrangled and analyzed 7 different datasets and ranked all the 42 NYC neighborhoods based on factors like safety, hygiene, road safety and other amenities. Created visualizations like map leaflet, heat maps, stem graphs, tree maps, etc. to show insights of different analysis done. Also used R Shiny package and built an app where the user can rate factors stated above and obtain the top 5 most suited neighborhoods in NYC.
Data Visualization of Neurons using The Looking Glass (Capstone Project) Language and Tool - C# and Unity: Teamed up with The Looking Glass and created 3D holographic visualizations for the NYU Neuroscience team. Created visualizations to help neuroscientists gain more insights on neuron firing pattern in 3D space which in turn will help them understand and identify animal behavior. Finding Efficiencies for DonorsChoose.org Tool - Weka: Analyzed DonorsChoose’s enormous vault of data, experimented with four classification models: J48, logistic regression, random forest and naive bayes to create a model to predict a project’s likelihood of receiving its full funding.
AirFrance searches on MSN US Language and Tool - R and MS Excel: Analyzed impressions and the click-through rate of ads, identified metric to prioritize keywords and provided a solution to optimize ad dollars. ECG Scanning Machine Data Analysis (Bachelors): Built an ECG unit, collected ECG readings from Smokers and Non-Smokers, analyzed the data and showed the findings through visualization. Achievements and Leadership Activities
• • Awarded Was a part the of “Titan the logistics Award” department for two consecutive for International years (2013-Emmy 2014) World for Television outstanding Festival achievements (2016) held at Accenture. in New York City.