GAGAN M BABU
Seaborn, R, Dplyr,
ggplot2, SQL, SQL
BE in Industrial
Production @ NIE,
Mysore with CGPA
Diploma in @ JSS
Mysore with 73.5%
Diploma in Data
Python and R @
for Beginners by
Academy - Google
Analytics Academy -
Looking for a suitable position as ML Engineer like Data Analyst where I can utilize my technical skills for the company’s growth. As a Data Analyst, the following responsibilities are performed:
Interpret data, analyze results using statistical techniques
Translate analysis and insights into ongoing management and technical reports
Develop and implement data collection and data
analytics strategies that optimizes statistical efficiency and quality
Identify, analyze, and interpret trends or patterns in complex data sets
Filter and “clean” data by reviewing compute reports and Performance indicators to locate and correct data quality issues
Locate and define new process improvement opportunities
Thrive in a team environment with strong interpersonal skills. Collaborate and build relationships with product owners, engineers, development teams, architects, operations partners, and business clients
Establish, and regularly update, multi-phase delivery roadmap
Technical expertise regarding data models, data mining and segmentation techniques
Knowledge of statistics and experience using statistical packages for analyzing datasets (Excel)
Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy
Adept at queries, report writing and presenting findings Personal Skills:
Attention to details
Proven ability in developing relationships with stakeholders, communicating project/program status, and understanding detailed business requirements across multiple project initiative
Attended an online training and got certified in Google Analytics tool.
Currently working @ Jubilant Generics Limited, Nanjangud as Associate Purchase from March 2018 to till date. Jubilant Generics Limited is an integrated global pharmaceutical and life sciences company engaged in manufacturing and supply of APIs, Solid Dosage Formulations, Radiopharmaceuticals, Allergy Therapy Products, Advance Intermediates, Fine Ingredients, Crop Science Ingredients, Life Science Chemicals and Nutritional Products.
Ensuring smooth work flow in the supply chain from end to end
Good skills and knowledge of SAP (Systems Applications and Products in Data Processing)
Minimizing inventory costs
Planning supply chain schedules in advance in preparation for periods
Negotiating with suppliers to minimize raw materials and achieve maximum efficiency
Providing accurate routing information to ensure that delivery times and locations are coordinate
Problem solving skills & supporting team with necessary information.
Ensuring Project closure with necessary details & submit to the management for capitalization.
Worked @ Unilog Content Solutions Pvt. Ltd., Mysore as Data Analyst from January 2016 to December 2017.
Unilog Content Solutions Pvt. Ltd is a B2B eCommerce software and content management. all-in-one, multitenant SaaS B2B digital commerce platform built with an API first/micro services architecture and a platform with strong site search, PIM, and user roles and B2B workflows-built in. Major Projects handled while working in Unilog Content Solutions Pvt. Ltd. Company:
Project – Russel Equipment’s, Villa Lighting, S&S, Orgill Software Platform used: Advanced MS-Excel with features like VLOOKUP, Pivot Tables and etc.
Data Enrichment by Collecting all Possible Critical Information by Web Search.
Production, Quality Check, Quality Analysis
List of the projects practiced during the Data Analyst training:
1. Iris Dataset:
This data set consists of the physical parameters of three species of flower — Versicolor, Setosa and Virginica.
In this data we will be predicting the classes of the flowers based on numeric parameters that are Sepal width, Sepal length, Petal width and Petal length.
NumPy, Pandas and Scikit Learn are some of the inbuilt libraries in Python that we have used.
Using algorithms, we have trained our model to check how accurate every algorithm
K – Nearest Neighbour (KNN), Logical Regression,
Random forest, SVM (Support Vector Machine)
Here, Random forest gives optimal accuracy compared to others
By using this we reduce the chances of overfitting and variance in the data which thus lead to better accuracy
2. House Price Prediction:
Thousands of houses are sold every day. There are some questions every buyer asks himself like: What is the actual price that this house
This data contains 1460 training data points & 80 features that might help us predict the selling price of a house.
To apply data preprocessing and preparation
Exploratory data analysis allows us to understand the data and the relationships between variables
Correlation between variables helps us to predictor variables are correlated with the target variable
Build machine learning models able to predict house price based on Algorithms: - Linear Regression,
Nearest Neighbors, Support Vector Regression,
Decision Tress, Neural Networks, Random Forest
Choose an algorithm that implements the
To analyze and compare models’ performance in
order to choose the best model
1. Movie Recommendation System:
To build a recommendation engine that
recommends movies to users
A recommendation system provides suggestions to
the users through a filtering process that is based on user preferences and browsing history.
In our Data Science project, we will make use of these four packages – ‘recommenderlab’,
‘ggplot2’, ‘data.table’ and ‘reshape2’.
Data pre-Processing will help to make the finalized dataset to build the model by using One-hot encoding
We will implement a single model– Item Based
Explore the most viewed movies in our dataset
Data Normalization is a data preparation procedure to standardize the numerical values in a column to a
common scale value
The algorithm first builds a similar-items table of the customers who have purchased them into a
combination of similar items. This is then fed into the recommendation system