Mayank Goswami
Chicago, IL ***** 312-***-**** *********@****.***.*** www.linkedin.com/in/mgoswami1/ EDUCATION
Illinois Institute of Technology
Master of Science
Information Technology &
Management
GPA: 3.8/4.0 May’ 2021
Relevant Coursework:
Data Analytics
Advanced Database Management
Data Mining
Data Warehousing
Database Security
Object-Oriented App. Development
Object Oriented System Modelling
Rich Internet
Project Management
Uttar Pradesh Technical University
Bachelor of Science
Electronics & Communication
Engineering
GPA: 7.2/10.0 June’ 2014
TECHNICAL SKILLS
Data Science
Supervised, Unsupervised, Linear
Regression, Logistic Regression, KNN,
Naïve-Bayes, TensorFlow, Data Mining,
Decision Trees, Pandas, NumPy,
Matplotlib, Scikit learn, Machine
Learning, ANOVA, Random Forest,
SVM, Clustering, Scipy
Languages
R, Python, Java, HTML, XML
Database
Oracle11g, Oracle 12c, MySQL, MS
Access, SQL, PL-SQL, SQL Management
Studio
Tools, IDE, Servers
Tableau, Microsoft Power BI, R-Studio,
Visual Studio, SOAP UI, JIRA, Jenkins,
LINUX, QC, Visio, GitHub, MS Office,,
Microsoft Teams, SVN, Confluence,
Pentaho-ETL, Cognos, Hadoop, Hive,
AWS, HDFS, PySpark, Spark
LEADERSHIP
Facilitated product training to sales
team for Asian ODC for Newly
Developed Games at Aristocrat
Technologies
Delivered technical and product
implementation training to GlobalLogic
partners (Guidewire, ICW)
PROFESSIONAL EXPERIENCE
Data Engineer Intern, Quicken Loans Nov’2020- Present
● Created ETL processes to transform data & created database schemas to store, query and transform large data sets
● Monitored and troubleshoot performance and scalability issues in a large-scale data storage environment and migrated ETL processes from SQL Server Databases to AWS Aurora RDS.
● Created python script for identification of non referenced columns, Stored procedures, triggers & views across every DB server and database and dropping them from Data Warehouse improving the efficiency by 30-40%
Data Warehouse Intern, RTI International Jun’2020- Aug’ 2020
● Create ETL solutions for customers with continuous improvement of data accessibility and reliability
● Migrated production data from SQL Server to Hadoop using Sqoop and created hive queries for data extraction to publish dashboards in Power BI for better visibility on Delivery, Quality KPIs
● Designed SSIS Job Monitoring Power BI dashboard for the entire Data Warehouse of RTI that eased down the monitoring of Jobs effort by 93%
● Designed Cognos Reporting Dashboard for entire RTI that displayed historical as well as predicted trends of the reports generation by RTI that helped in managing resources and system work by 27%
Senior Analyst (Engineer II), Aristocrat Technologies Mar’ 2018 – Aug’ 2019
● Streamlined Casino Games (SDLC) following AGILE methodology from overall planning to deployment of the game software deliverables
● Implemented game modelling framework, business rules and business intelligence, ensured quality standards and risk management, report analysis and dashboard using Tableau as the front-end user interface
● Predicted game RTP by Linear regression model using R which involved large dataset cleaning
& pre-processing and presented insights to the team which reduced operating cost by 7.4%
● Strategize in developing new game ideas, POCs & business process modelling diagrams using Visio; enhanced system functionality by 33%
Data Engineer, GlobalLogic Oct’ 2015 – Mar’ 2018
● Reported trend metrics for large datasets through the creation of summarized analysis, clear, compelling infographics, data visualization in Tableau
● Systemized project releases using Kanban board dashboard on JIRA
● Facilitated in Poker Planning, sprint planning, SCRUM & RCA for the action items
● Developed Key-Word driven automation framework resulted in reducing manual efforts by 40%
● Accelerated 17% product sales by designing web scraping tool in Python to extract product usage data; summarized using Spark to correlate customer behavior Associate Engineer, Accenture Mar’ 2015 – Sept’ 2015
● Performed data analysis and cleansing by set operations & aggregate functions using advanced SQL queries, designed and developed pivot tables in MS Excel to improvise productivity & standardize reporting
● Published defect reports using Power BI creating ‘Triage Metrics’ dashboards for key clients
ACADEMIC PROJECTS
Google App Store Rating Prediction: (R, KNN, Logistic Regression) Built classification models like KNN & Logistic Regression by applying data transformation and selected the best model with highest accuracy of 80.2% using RMSE. Performed ANOVA hypothesis testing and identified “Games” category with high average mean Rating Ad-Click Prediction: (Python, Jupyter, Pandas, NumPy, Feature Tools, Matplotlib, Tableau, Scikit learn, confusion matrix) Performed several data preprocessing and transformations on the data set and to tackle data imbalance issue did over sampling using Smote and generated the confusion matrix and then built KNN, Decision Trees, Naïve-Bayes & Random Forest models and evaluated on the highest accuracy
Covid-19 Data Visualization: (ETL, Pentaho, Tableau) Created a Data Warehouse for multiple datasets (Covid-19, Government Response and Symptoms) using Pentaho, extracted and populated the data into fact table using advanced SQL queries to refine the data and linked the data to Tableau and created multiple time series visualizations, stories and dashboards