Multiple years of IT experience and Analytics experience in data analysis, strategic analysis, Predictive Analytics and Inferential statistics, data mining and data visualization to provide analytical solutions to client's business problems.
Strong foundation in Mathematics, Statistics, Machine Learning and Data Mining techniques.
Proficient in managing entire data science project life cycle and actively involved in all the phases of project life cycle including
Data acquisition (sampling methods: SRS/stratified/cluster/systematic/multistage)
Power Analysis, Hypothesis testing, effect size
EDA (Univariate & Multivariate analysis)
Data Imputation (outlier detection via chi sq detection, residual analysis, PCA analysis, multivariate outlier detection)
Statistical modeling both linear and non-linear (logistic, linear, Naïve Bayes, decision trees, Random forest, neural networks, SVM, clustering, KNN)
Dimensionality reduction using Principal Component Analysis (PCA) and Factor Analysis, testing and validation using ROC plot, K- fold cross validation, statistical significance testing.
Worked on Reinforcement Learning Techniques like Multi Armed Bandit (UCB & Thompson Sampling) A/B testing.
Experience in Data Analysis, obtain insights from data then choose appropriate Machine Learning/Data Mining algorithms (Functional/classification/cluster models etc) using cost functions (gradient/normal and stochastic)
Perform model validation (AIC/BIC/APE/MAPE, confusion matrix, ROC curve, ks curve, decile analysis, Gini coefficient)
Experience with various application like SAS, SPSS, R(packages- knitr,dplyr, tidyr, SparkR, causallnfer, spacetime), Python (sklearn/ scipy/numpy/panda).
Job Title: Data Scientist
Location: Atlanta, GA
Duration: Sep’15 – Current
Objective: Modelling to determine Internet rates and plans against competitors like Xfinity to target customers for personalized marketing strategies.
•Personalization, Inflection points tracking, Target Marketing, Customer profiling and Segmentation
•Personalization for registered users, Returning visitors and Anonymous visitors
•Worked on model building using machine learning algorithms like logistic regression, naive bayes, random forest, SVM, SVR.
•Worked on a POC project for NLP(Natural Language processing) with Review Data.
•Propensity to convert modelling using the visit sequence and conversion sequence mapping
•Zeroth problem solution for Anonymous visitor, returning visitor and registered users
•Text analytics on review data machine learning technique in python using NLTK.
•Market basket Analysis using APRIOR for purchase transactions data at store level.
•Sequence mining for the conversion paths for different segments
•Profiling based on the content access reports, event trigger reports and navigation reports. Profiles are then used for target marketing and segmentation
•Custom Website strategy for segments.
•Site/Page optimization, Promotional campaigns assessment, Customer segmentation and assessing revenue against targets at regular intervals
•Used UCB & Thomson Sampling Intuition for Multi Bandit Testing and A/B testing to perform ad banner choices and product visual choices optimization.
•Custom Dashboards product category wise.
•Developed visualizations using sets, Parameters, Calculated Fields, Dynamic sorting, Filtering, Parameter driven analysis, gathered data from different data marts.
•Reporting designs based on the business specific problems, Reporting implementation on Tableau.
•Advanced charts, drill downs and intractability are incorporated in the reporting for different stakeholders and integrating the publishing of reports to the clients SharePoint infrastructure.
Client: State Farm
Job Title: Data Scientist
Location: Bloomington, IL
Duration: Jan’12 –Aug’15
Objective: Modelling Individual and Family Insurance rate based on Income and Demography for Market campaigning.
•Participated in all phases of data mining- data collection, data cleaning, developing models, validation, visualization and performed Gap analysis.
•Providing Ad hoc analysis and reports to Executive level management team.
•Captured Modelling requirements from Senior Stakeholders to Bench functional requirements for SAS/ R Python
•Performed Data Manipulation and Aggregation from Various source including HDFS.
•Creating various B2B Predictive and descriptive analytics using R and Tableau
•Used pandas, numpy, seaborn, scipy, matplotlib, scikit-learn in Python for developing various machine learning algorithms.
•Designed and tested Predictive Algorithms using Historical Data
•Utilized machine learning algorithms such as Decision Tree, linear regression, multivariate regression, Naive Bayes, Random Forests, K-means, & KNN.
•Parsing data, producing concise conclusions from raw data in a clean, well-structured and easily maintainable format.
•Responsible for Big data initiatives and engagement including analysis, brainstorming, POC, and architecture.
•Worked on different data formats such as JSON, XML and performed machine learning algorithms in R
•Worked on Map Reduce/Spark Python modules for machine learning & predictive analytics in Hadoop on AWS.
•Worked with (Tableau) Report Writers to Test, Validate Data Integrity of Reports
Client: Accenture Private limited
Job Title: Modeling Analyst
Location: Peoria, IL
Duration: Aug’ 10 – Dec’ 2011
Objective: Modelling Individual and Family Insurance rate based on Income and Demography for Market campaigning
Prepared financials reports, analysis of month on month variance and provided reports to management on a weekly, monthly and quarterly basis
Scrutiny of General Ledger on monthly basis and do a financial analysis of the same. Provide the information in an appropriate manner to the management
Monitored costs & trend analysis and providing updates on variances to respective stakeholders
Prepared annual budget and periodic forecast/outlook and Generating Insights
Partnered with Leadership to understand the requirement and provide reports, Analyze business units
Recommended solutions for areas of concern with financial data.
Provided ad-hoc reports as per requirement from time to time