Atin Maiti
Tampa, FL ***** ac8o60@r.postjobfree.com 973-***-****
LinkedIn: https://www.linkedin.com/in/atin-maiti/
GitHub: https://github.com/AtinMaiti
Tableau: https://public.tableau.com/profile/atin4083#!/
Executive Summary
Data Scientist, having a profound understanding of data science lifecycle with expertise in building data pipelines in Python and R
Certified Tableau Desktop Specialist possessing excellent storytelling skills with proficiency in data acquisition and manipulation
Ability to leverage relational database systems (MS SQL Server, Oracle SQL), and no-SQL database systems (Hive, Cassandra)
Professional Experience
Data Science Intern, Nextech Systems LLC, Tampa, Florida May 2018 – Present
Developed a Predictive Model on patient appointment data to predict the chances of no-shows with the help of Machine Learning APIs by incorporating feature engineering, hyperparameter tuning, model selection, and created flask API for production use
Performed Quantitative and Qualitative analysis on market data with the help of statistical approaches to assist the company make an informed decision during business expansion using Python (Matplotlib, Seaborn) and Power BI
Devised a proof of concept model by training a Convolutional Neural Network to classify optical imaging data, the model will reduce the efforts to identify the type of imaging data efficiently when imported from external storage
Assistant System Engineer, Tata Consultancy Service Limited, Thane, India Aug 2015 - Jul 2017
Identified meaningful insights from large datasets and developed dashboard for tracking Key Performing Indexes using Tableau
Analyzed enhancement requests to examine its feasibility with business, fixed production bugs with SQL queries on Toad
Provided ad-hoc data analysis and create customized Salesforce reports for clients and supervisor
Education
MS in Business Analytics & Information System, The University of South Florida, GPA 3.71/4 Aug 2017 - May 2019
BS in Electronics and Telecommunication, The University of Mumbai, GPA 3.5/4 Aug 2011 - May 2015
Relevant Projects
High Risk Client Prediction - Czech Bank (Python) Fall 2018
Transformed bank’s relational database into a single DataFrame using Python, identified the target variable, performed Feature Engineering to train a binary classifier model to predict high-risk clients based on defaulter in loan payments
Used SMOTE analysis technique to defeat the imbalance in data and accomplished 90% accuracy after comparing several models
California Housing Price Analysis using Big Data Techniques (Hadoop, Spark, AWS S3) Spring 2018
Performed exploratory data analysis and predictive modelling on housing data to determine the house price and their interactions
Ingested data from data lake (AWS - Simple Standard Storage) into HDFS using PySpark - MLLib
Florida School Shooting, Sentiment Analysis, and Visualization (R and Tableau) Spring 2018
Extracted tweets and performed Sentiment Analysis of people's thoughts in their tweets by categorizing words by emotions
Major Insights: Emotion of fear was more prevalent than anger as these saddening and tragic events occur
Walmart Sales Prediction (R Programming) Fall 2017
Trained Multi-Level Ordinary Least Square model and ARIMA model to predict weekly sales of stores by department
Key finding: Significant and measurable effects on sales due to an interaction of promotion and holiday
Certifications
Inferential Statistics by Duke University on Coursera, certificate earned on 29-Jan-2019 Link
Digital: Bigdata and Hadoop Ecosystems Foundation, certificate earned on 15-Apr-2016 Link
Skills
Languages: Python, Spark (PySpark), Sqoop, Hive, Cassandra, SQL (T-SQL, PL/SQL), R, JSON, HTML, Java, SAS
Algorithms: Decision Tree, Random Forest, Naive Bayes, Neural Networks, K-NN, Linear and Logistic Regression, Support Vector Machine, Deep Learning, PCA, K- means Clustering, Hypothesis testing
Technologies: Tableau, Power BI, Weka, SAS Enterprise Miner, Microsoft Visual Studio, MS Excel, MS Visio, Salesforce, SSMS, GitHub, Amazon Web Services – (EC2, S3, RDS), Google Analytics, Databricks, SSIS
Techniques: A/B testing, ETL, web-scrapping, working with REST API