Post Job Free

Resume

Sign in

Engineer Data

Location:
Detroit, MI
Posted:
February 06, 2021

Contact this candidate

Resume:

Mayank Goswami

Chicago, IL ***** 312-***-**** adjzc8@r.postjobfree.com www.linkedin.com/in/mgoswami1/ EDUCATION

Illinois Institute of Technology

Master of Science

Information Technology &

Management

GPA: 3.8/4.0 May’ 2021

Relevant Coursework:

Data Analytics

Advanced Database Management

Data Mining

Data Warehousing

Database Security

Object-Oriented App. Development

Object Oriented System Modelling

Rich Internet

Project Management

Uttar Pradesh Technical University

Bachelor of Science

Electronics & Communication

Engineering

GPA: 7.2/10.0 June’ 2014

TECHNICAL SKILLS

Data Science

Supervised, Unsupervised, Linear

Regression, Logistic Regression, KNN,

Naïve-Bayes, TensorFlow, Data Mining,

Decision Trees, Pandas, NumPy,

Matplotlib, Scikit learn, Machine

Learning, ANOVA, Random Forest,

SVM, Clustering, Scipy

Languages

R, Python, Java, HTML, XML

Database

Oracle11g, Oracle 12c, MySQL, MS

Access, SQL, PL-SQL, SQL Management

Studio

Tools, IDE, Servers

Tableau, Microsoft Power BI, R-Studio,

Visual Studio, SOAP UI, JIRA, Jenkins,

LINUX, QC, Visio, GitHub, MS Office,,

Microsoft Teams, SVN, Confluence,

Pentaho-ETL, Cognos, Hadoop, Hive,

AWS, HDFS, PySpark, Spark

LEADERSHIP

Facilitated product training to sales

team for Asian ODC for Newly

Developed Games at Aristocrat

Technologies

Delivered technical and product

implementation training to GlobalLogic

partners (Guidewire, ICW)

PROFESSIONAL EXPERIENCE

Data Engineer Intern, Quicken Loans Nov’2020- Present

● Created ETL processes to transform data & created database schemas to store, query and transform large data sets

● Monitored and troubleshoot performance and scalability issues in a large-scale data storage environment and migrated ETL processes from SQL Server Databases to AWS Aurora RDS.

● Created python script for identification of non referenced columns, Stored procedures, triggers & views across every DB server and database and dropping them from Data Warehouse improving the efficiency by 30-40%

Data Warehouse Intern, RTI International Jun’2020- Aug’ 2020

● Create ETL solutions for customers with continuous improvement of data accessibility and reliability

● Migrated production data from SQL Server to Hadoop using Sqoop and created hive queries for data extraction to publish dashboards in Power BI for better visibility on Delivery, Quality KPIs

● Designed SSIS Job Monitoring Power BI dashboard for the entire Data Warehouse of RTI that eased down the monitoring of Jobs effort by 93%

● Designed Cognos Reporting Dashboard for entire RTI that displayed historical as well as predicted trends of the reports generation by RTI that helped in managing resources and system work by 27%

Senior Analyst (Engineer II), Aristocrat Technologies Mar’ 2018 – Aug’ 2019

● Streamlined Casino Games (SDLC) following AGILE methodology from overall planning to deployment of the game software deliverables

● Implemented game modelling framework, business rules and business intelligence, ensured quality standards and risk management, report analysis and dashboard using Tableau as the front-end user interface

● Predicted game RTP by Linear regression model using R which involved large dataset cleaning

& pre-processing and presented insights to the team which reduced operating cost by 7.4%

● Strategize in developing new game ideas, POCs & business process modelling diagrams using Visio; enhanced system functionality by 33%

Data Engineer, GlobalLogic Oct’ 2015 – Mar’ 2018

● Reported trend metrics for large datasets through the creation of summarized analysis, clear, compelling infographics, data visualization in Tableau

● Systemized project releases using Kanban board dashboard on JIRA

● Facilitated in Poker Planning, sprint planning, SCRUM & RCA for the action items

● Developed Key-Word driven automation framework resulted in reducing manual efforts by 40%

● Accelerated 17% product sales by designing web scraping tool in Python to extract product usage data; summarized using Spark to correlate customer behavior Associate Engineer, Accenture Mar’ 2015 – Sept’ 2015

● Performed data analysis and cleansing by set operations & aggregate functions using advanced SQL queries, designed and developed pivot tables in MS Excel to improvise productivity & standardize reporting

● Published defect reports using Power BI creating ‘Triage Metrics’ dashboards for key clients

ACADEMIC PROJECTS

Google App Store Rating Prediction: (R, KNN, Logistic Regression) Built classification models like KNN & Logistic Regression by applying data transformation and selected the best model with highest accuracy of 80.2% using RMSE. Performed ANOVA hypothesis testing and identified “Games” category with high average mean Rating Ad-Click Prediction: (Python, Jupyter, Pandas, NumPy, Feature Tools, Matplotlib, Tableau, Scikit learn, confusion matrix) Performed several data preprocessing and transformations on the data set and to tackle data imbalance issue did over sampling using Smote and generated the confusion matrix and then built KNN, Decision Trees, Naïve-Bayes & Random Forest models and evaluated on the highest accuracy

Covid-19 Data Visualization: (ETL, Pentaho, Tableau) Created a Data Warehouse for multiple datasets (Covid-19, Government Response and Symptoms) using Pentaho, extracted and populated the data into fact table using advanced SQL queries to refine the data and linked the data to Tableau and created multiple time series visualizations, stories and dashboards



Contact this candidate