Post Job Free

Resume

Sign in

Data Assistant

Location:
DeKalb, IL
Posted:
March 03, 2021

Contact this candidate

Resume:

Prasad Patil

*** ******* **, ******, **

adkmxf@r.postjobfree.com +1-815-***-****

EDUCATION

Northern Illinois University, DeKalb, IL Dec 2020

Master of Science in Management of Information Systems (Specialization in Business Analytics)

University of Mumbai, India Jan 2018

Bachelor of Engineering in Computer Engineering

TECHNICAL SKILLS & CERTIFICATIONS

Reporting Tools: Tableau, Microsoft Excel (Pivot tables, vlookup), Power BI

Database: MS SQL Server, MySQL, MS Access, MongoDB

Programming Proficiency: Python (Pandas, Scikit learn, NumPy, Matplotlib), R (dyplyr, Tidyverse, Sparklyr, ggplot2), Java

BI Tools: SAP NetWeaver Business Warehouse, SAP Business Object Analysis, and Query Designer

Statistical Techniques: Simple and Multiple linear Regression, Hypothesis Testing, Correlation, ANOVA, Moderation Analysis, Chi Square, Cluster and Factor Analysis

Machine Learning: Supervised and Unsupervised machine learning models, Naive Bayes, SVM, Decision trees

Other Tools & Techniques: MS Office Suite, Jupyter Notebook, UML/Use Cases, SAP ERP, SAS, Agile/Scrum development, ETL

Certifications: 1) Machine Learning Projects with Google Cloud Platform (Coursera), 2020 – Big Query, Dialog flow, AutoML, Vision, Data Governance

2) AT&T Summer Learning Academy Extern, 2020 – Entry Level training in human resource finance, advertising, media and technology

3) GE Digital Technology Data Analytics Program, 2020 – Table unions and joins using visual prepare recipe and SQL recipe on the Dataiku platform, building a KPI table, creating insights and Dashboard visualizations on Dataiku platform

4) KPMG Data Analytics Consulting Virtual Internship, 2021 – Data quality assessment, Data Insights and presentation

5) Post-Crisis Leadership Certificate, University of South Florida, 2020 - recruiting, organizing, evaluating, and leading a resilient team, leveraging key data to model, analyzing, and visualizing multiple possible scenarios

Publications: Hierarchical Attribute-Set- Based Encryption and Enhanced Access control in Cloud Computing. – International Conference on “Emerging Trends on Computing and Communication” (ICETCC – 2017)

WORK EXPERIENCE

Graduate Assistant, Northern Illinois University, DeKalb, IL Aug 2019 – May 2020

Created statistical reports and visualization of 200+ students enrollment data for the OMIS Department of the University.

Assisted the Department chair with research activities.

Assisted a class of 30+ students in understanding linear programming, forecasting, inventory models, decision theory, simulations and statistical models.

Corrected 30+ assignments & H/W each week, proctored tests, and provided grading assistance according to the university standards.

ACADEMIC PROJECTS

Analysis of Divvy Bike share data and influence of external factors Jan2020 – May 2020

Performed Data wrangling and inspection on the Divvy Bikes dataset of 1million records from the famous Bike sharing company based out of Chicago. Activities involved Data standardization and union of separate dataset file.

Performed basic aggregate functions using R programming such as average age of the rider, rider gender distribution, count of bikes in operation, minimum and maximum duration for which bikes are operated to plan maintenance and service schedule.

Visualized the insights in R studios using graphs such as scatter plots, bar charts and pie diagrams.

Built a Multiple regression model to predict average bookings per day given the average temperature, snow depth and rain for that day. Significant p value was achieved.

Built a Logistic Regression model to predict if user type group is Subscriber or Customer based upon average trip duration and number of bookings on a particular day. Significant ROC curve and AUC values were achieved.

Clustered Station id’s based upon booking count and trip duration using K-means clustering algorithm to prioritize which stations should have more bikes.

Data Analysis on Smartphone Data using Python (Amazon.com) Aug 2019 – Dec 2019

Performed Sentiment Analysis using Vader sentiment analysis tool on a data set of around 4.5k records of reviews of 200 different unlocked smartphones purchased on amazon.com about their experience using a particular brand phone, calculated and visualized polarity scores alongside other attributes to understand customer satisfaction.

Developed a Topic model using LDA to identify and analyze 30 dominating keywords from different topics across the whole corpus and created tables, graphs, and word cloud using Tableau to communicate the results and delivered a final presentation.

Relay Bike Share System using MS SQL Server Jan 2019 – May 2019

Created a snowflake schema-based SQL database for the Relay Bike Share Company data with attributes like Customer, Station, Bikes ID, Accessories, Employee, Maintenance, etc. Created an Entity Relationship model for the database.

Performed SQL Commands of DDL, DML, DQL to design and execute complex queries on the database.

Hierarchical attribute-set-based Encryption and Enhanced access control in cloud computing Apr 2017- Sep 2017

Developed a cryptosystem called ABE (Attribute based Encryption) to enable secured file storage with hierarchical access control structure on drivehq cloud platform.

Used technologies such as Java, Netbeans IDE and VMware.



Contact this candidate