KANISHKA JAIN
+1-301-***-**** ********@***.*** College Park, MD, United States Linkedin /kanishk-jain GitHub /kanishk307 Analyst with a highly analytical background and experience in using data cleaning, pre-processing, data mining, data exploration & visualization to solve business problems; hands-on experience in web development.
E D U C A T I O N
Master of Information Management (Data Analytics) GPA: 4.0 College of Information Studies, University of Maryland, College Park, United States Expected May 2021
Courses Completed: Data Analytics, Data Science, Machine Learning, Location Intelligence, Management Concepts and Principles
Bachelor of Engineering (B.E.) (Information Technology) GPA: 8.44 / 10 University of Mumbai, India Jun 2019
Courses Completed: DBMS, Data Mining and Business Intelligence, Software Project Management, Big Data Analytics, Soft Computing, OOPM, Data Structures
Skills: SQL (MySQL, SQLite, PostgreSQL), R, Python (Numpy, Scipy, pandas, Matplotlib, Seaborn, scikit-learn), Tableau, Tableau Prep, MS Excel, Power BI, QGIS, Google Suite, MS Office, Git, GitHub Data Wrangling, Data Analysis, Communication and Data Visualization, Machine Learning, Statistics
P R O F E S S I O N A L E X P E R I E N C E
Graduate Teaching Assistant University of Maryland, College Park, MD Sep 2020 – Present
Proposed and developed Programming Component Exercises, projects for undergraduate course INST126 – Introduction to Python Programming
Developed test-cases for automated grading, evaluated projects, monitored progress, and resolved concerns of over 50 students Research Analyst Intern ANGARAI, Greenbelt, MD Jun 2020 – Aug 2020
Overseen real-time application of integrated Data analysis & validation of ANGARAI Quality Management Systems.
Handled integration for the website and portals for user experience enhancement in relation to the management team.
Collected and analyzed various data sets for ANGARAI’s Quality Management System from various other functional departments.
Tested, demonstrated, compiled, and documented IT management requirements, in compliance with industry standard practice. Summer Research Assistant Department of Communication, University of Maryland, College Park, MD Jun 2020 – Aug 2020
Investigating universities’ responses to the COVID-19 pandemic while conducting a content analysis of approximately 530 higher education institutions’ COVID- 19 websites. Worked with various python libraries such as numpy, pandas, selenium, matplotlib, etc.
Built a crawler for data collection and solved challenging Data Analysis problems Graduate Research Assistant & Web Developer Institute of Systems Research, University of Maryland, College Park, MD Oct 2019 – Jun 2020 Growth Path: Web Developer (Oct 2019 – Jan 2020) Graduate Research Assistant (Jan 2020 - Jun 2020)
Managed and developed databases, websites & custom plugins based on Wordpress and Drupal.
Improved hits by around 50% by building a new Wordpress based website
Involved in strategic decision making to ensure smooth inflow and outflow of departmental funds Data Analyst Digital Curation Innovation Center, University of Maryland, College Park, MD Aug 2019 – Dec 2019
Participated in an intensive digital curation program and explored the computational treatments of archival records.
Extracted and visualized confidential WWII Japanese-American Incarceration Camp records while working alongside a multifaceted team.
P R O J E C T S
Indian Food Recipe Dataset Generation Python (Numpy, Pandas, BeautifulSoup, Googletrans), MS Excel, TableauPrep Oct 2020 – Nov 2020
Crawled, Scraped, Cleaned and Transformed a website to create a dataset of 6800+ Indian Food recipes.
Used Googletrans, a python library built on top of Google Translate API to make records consistent.
The dataset ranks consistently among top 15 in the all-time hottest datasets section of Kaggle Country wise Visualization and Statistical Information based on Covid-19 API Python (Plotly, Numpy, Pandas, Matplotlib) Mar 2020 – Jul 2020
Country wise (based on user input) information of various aspects related to COVID19. Data is taken from the COVID19 Free API. Visualizations of Total Cases, Total Recoveries, Active Cases, Total Deaths based on real-time data. Statistical information including critical numbers for the past 14 days.
Scraped data from a website (Worldometer) to develop country-wide exploratory analysis of data related to COVID-19 based on factors like Total Cases, New Cases, Total Deaths, Total Recovered, Active Cases, Serious/Critical, Total Cases/1M Population. Insights and Predictions based on H1B dataset MS Excel, SQL, Python, R Jan 2020 – May 2020
Applied Regression and Classification models (such as SVM, Naive Bayes, Neural Network, Random Forest) & clustering to predict the case outcome
(certified/denied), duration for getting back the result, threshold hourly salary and predicting presence of agent. Analysis of Global Terrorism Dataset MS Excel, Python, TableauPrep, SQL Aug 2019 – Dec 2019
Conducted Data cleaning and data visualization of Global terrorism dataset (180k+ entries) available on Kaggle to draw useful insights.
Researched on the regions/countries that are the most affected by terrorism, terrorist groups have been most active, most common attack types, intensity of attacks by region for a particular year, weapons used in these attacks and most common target types.
P A P E R P U B L I C A T I O N S
‘Reframing Digital Curation Practices through a Computational Thinking Framework’ in the proceedings of the 4th IEEE Big Data 2019 workshop on Computational Archival Science.
‘Real Time inspection of Concrete Structures Using Computer Vision and Deep Learning’ published in South Asian Research Centre (SARC) and International Journal of Advances in Mechanical and Civil Engineering (IJAMCE). O R G A N I Z A T I O N S A N D A F F I L I A T I O N S
CSI-RAIT - Joint Secretary Jun 2018 – Jun 2019: Managed techtalks, hackathons and various other technical events
CSI-RAIT – Editor-in-Chief July 2016 – Jun 2018: Worked with a team of 15 writers to generate content for magazines and websites.
A W A R D S
Recipient of the 2019 DCIC Computational Archival Science (CAS) Award in recognition of contributions of the Japanese-American WWII Camp Project
& “Resistance at Tule Lake” Public Forum
Winner of Makeathon 2020, a hackathon organized by UXTerps, University of Maryland for the problem statement sponsored by Proctor and Gamble.
Recipient of the Laurence B. Helprin Award from College of Information Studies, University of Maryland for the 2019-20 academic year for the paper
“Reframing Digital Curation Practices through a Computational Thinking Framework”
Certified Professional for Requirements Engineering (CPRE) by International Requirements Engineering Board (IREB)