Data Analyst

Toronto, ON, Canada
May 12, 2020

Sandhya Rani

Toronto, ON


Data Scientist with 3+ years of experience at collecting, analyzing and interpreting large datasets, developing new forecasting models and performing data visualization tasks.


Programming: Data Gathering/Data Cleaning/Data Mining/ Descriptive Analysis/ Modeling; Python: pandas, numpy, scikit learn, TensorFlow; web scrapping: (python: beautiful soup)

Visulization: Python: matplotlib, seaborn; Power BI

Data Tools: SQL, Excel, Google Analytics, Python(Jupyter, pycharm), RStudio

Modelling: Machine Learning Algorithms: Linear and Logistic Regression, Random Forests, Decision tree

Statistics: Pearson’s correlation test, Chi-squared Test, D’Agostino’s K^2 Test, Analysis of Variance Test

Areas of Interest: Predictive Modelling, Recommendation Systems, Visulization, Fraud Analytics, Statistical algorithms and Deep Learning

Soft Skills: Problem Framing, Leadership, Dependability, Motivation and Teamwork

Others: Testing, Agile methodology, SDLC, STLC


MHS Assessments (Leading developer of innovative scientific assessments) - Toronto, CAN

Data Scientist – Product Analysis and Management 03/19 – Present

Built models to predict customer churn using logistic regression model with accuracy of 85%

Contributed to 10% of company’s strategical growth in performance by analyzing un-utilized old data

Cleaned and analyzed datasets comprising of nearly million rows of data sets (structured and semi-structured) by performing EDA using python packages like pandas, matplotlib, SciPy and seaborn

Performed feature engineering to most accurately represent the underlying structure of the data by decomposing or splitting features, from external data sources, or aggregating or combining features to create new features and therefore create the best model.

Built specialized reports for CS team to reachout to customized clients profiles from across the globe to follow-up with clients and marketing purposes

Created custom visual dashboards of customer views and transactional insights for sales and marketing team with cleaned data from SQL database using PowerBI

Created data flow diagrams and process models using Visio

Created documentation for all the processes

Athena Global Technologies (Software development and consulting services) - India

Data Analyst – Research and Development 08/15 – 11/16

Developed machine learning based logestic regression model to detect the probability of fraud and other security threats for a leading bank

Cleaned customer data by performing exploratory data analysis and feature selection and managed data to reduce any bias in the data

Created data visualizations dashboards for better understanding of data using PowerBI

Coordinated with a team of data scientists on success and failures of the models by comparing the precision and RMSE values of different models and other metrics as ROC curve, confusion matrix and discuss ideas

KMAX IT Professionals (Software development and consulting services) – India

Testing Engineer – Software Engineering 06/12 – 06/15

Work closely with software developers/project owners and BAs to develop and execute thorough test suites in all phases of the software development cycle

Develop Test strategy, test plan/design, execute test cases and defect management for the ETL & BI systems

Analyze and understand the ETL work flows developed

Perform data analytical testing for the BI systems

Validation of data transformations and perform End-to-End data validation for ETL & BI systems

Develop and execute detailed ETL related functional, performance, integration and regression test cases, and documentation


Bachelor of Technology (2008 – 2012) – Electrical and Electronics Engineering, India

Master’s Degree (2017 – 2018) – Cloud Computing for Big Data, Canada

Certified IBM Data Science Professional – 2018 – Present

Sprigboard Data Science Career Track Certification

