Anirudh Mittal
Data Scientist
Email/Skype: ******************@*****.*** / manirudh2803
Phone: 774-***-****
Summary:
8+ of IT experience and Analytics experience in data analysis, strategic analysis, Predictive Analytics and Inferential statistics, data mining and data visualization to provide analytical solutions to client's business problems.
Experience in working as a team to develop and deliver enterprise-wide solutions for complex data challenges.
Experience in developing and applying methods by collecting, processing and analyzing large volumes of data to build and enhance products, processes and systems.
Experience with Python, R, Java, Scala applied to programming access and manipulation of data.
Experience in machine learning in commercial and finance domain.
Experienced in Machine Learning techniques like clustering, decision tree learning, artificial neural networks and their real time advantages and drawbacks.
Expertise in math, statistics and quantitative analysis. Used analytic techniques such as classification, regression, similarity matching, clustering, co-occurrence grouping, profiling like prediction, data reduction, and casual modeling.
Experience in developing and deploying an automated process.
Strong knowledge of state-of-art ML algorithms like Bayesian learning, latent/topic modelsapproximates inference, deep networks, and stochastic processes.
Experience in deep learning techniques for NLP like test classification, language modeling, question answering.
Experienced in Machine Learning Classification Algorithms like Logistic Regression, K-NN, SVM,Kernel SVM, Naive Bayes, Decision Tree and Random Forest classification.
Experience and expertise in machine learning based predictive modeling projects with machine learning models like Gradient Boosting, Collaborative filtering, Bayesian Methods, Random Forest, SVM, Markov Models.
Experienced in advanced statistical analysis and predictive modeling in structured and unstructured dataenvironment.
Good knowledge of Linear Algebra, random process and statistical analysis.
Experience in translating business and product questions into analytics projects.
Experiences with high velocity data like click streams.
Good understanding of optimization techniques like first order, second order optimization, gradient descent etc.
Hands on experience of Data Science libraries in Python such as Pandas, NumPy, SciPy, neurolab, NLTK.
Hands on experience on R packages and libraries like ggplot2, h2o, dplyr, plotly, RMarkdown etc.
Experience in various phases of Software Development life cycle (Analysis, Requirements gathering, Designing) with expertise in writing/documenting Technical Design Document (TDD), Functional Specification Document (FSD), Test Plans, GAP Analysis and Source to Target mapping documents.
Experience in Deep Learning models using Theano, Tensorflow, scikit.learn packages using Python.
Excellent understanding of Hadoop architecture and Map Reduce concepts and HDFS Framework.
Strong understanding of project life cycle and SDLC methodologies including RUP, RAD, Waterfall and Agile.
Good understanding in ETL, Data warehousing, Data Marts, OLAP and OLTPtechnologies.
Experience working on data visualization tools (Tableau, matplotlib) and graphical analysis techniques like community detection, random walk.
Professional Experience
AT&T, Seattle, WA
Role: Data Scientist
August 2015 to Current
Objective: Modelling to determine best customer experience against competitors like Xfinity to target customers for personalized marketing strategies. AT&T supports sales Online (Ecommerce) and other sales channels (Retail stores, POS, OPUS).
Responsibilities:
•Working as a part of Sales Catalog Data Management Team.
•Project involves integrating Wireline (EDW) and Wireless (eCDW) data.
•Closely work with the IT, Sales and Marketing leads to formulate hypothetical insights, predictive analysis and work through delivering these insights to production.
•Closely work in providing data for the TDATA personalisation engine for better and advanced customer experience while shopping online.
•Personalization for registered users, Returning visitors and Anonymous visitors.
•Worked in gathering the required events (System Events, User Events) for reporting to provide a better application GUI.
•Inflection points tracking, Target Marketing, Customer profiling and Segmentation.
•Worked on model building using machine-learning algorithms like logistic regression, naive bayes, random forest, SVM, SVR.
•Utilized pandas, numpy, seaborn, scipy, matplotlib, scikit-learn in Python for developing various machine-learning algorithms.
•Worked on a POC project for NLP (Natural Language processing) with Review Data.
•Propensity to convert modelling using the visit sequence and conversion sequence mapping.
•Zeroth problem solution for Anonymous visitor, returning visitor and registered users
•Perform text analytics for the CHAT application via review data machine learning technique in python using NLTK to improve better recommendations for the Reps when advising a customer.
•Used UCB & Thomson Sampling Intuition for Multi Bandit Testing and A/B testing to perform ad banner choices and product visual choices to support customer from different geographic locations and off different classes (Student, Employee, IRU’s, and CRUS’s).
•Market basket Analysis using APRIOR for purchase transactions data at store level to recommend customers with the different combinations of the products like better internet speed with the best DIRECT TV packages as a bundle. Custom Dashboards product category wise.
•Sequence mining for the conversion paths for different segments.
•Profiling based on the content access reports, event trigger reports and navigation reports. Profiles are then used for target marketing and segmentation.
•Custom Website strategy for segments.
•Site/Page optimization, Promotional campaigns assessment, Customer segmentation and assessing revenue against targets at regular intervals.
•Developed visualizations using sets, Parameters, Calculated Fields, Dynamic sorting, Filtering, Parameter driven analysis, gathered data from different data marts.
•Reporting designs based on the business specific problems, Reporting implementation on Tableau to suggest business ways to improve their customer experience both in terms of sales and user interface.
•Advanced charts, drill downs and intractability are incorporated in the reporting for different stakeholders and integrating the publishing of reports to the clients SharePoint infrastructure.
Environment: Python, SQL, TDP, UML, Mac, Seaborn, ggplot, Tableau, Postman, HPQC, Anaconda, Jupiter notebook, Visio.
State Farm, Bloomington, IL
Role:Data Scientist
Jan 2012 to May 2015
Objective: Modelling Individual and Family Insurance rate based on Income, Demography and competitors strategy for Market campaigning.
Responsibilities:
•Performed predictive analysis and developed models to provide insight to business on the rates for the insurance plans based on several parameters (Demography, Competitors market, different Customer classes).
•Participated in all phases of data mining- data collection, data cleaning, developing models, validation, and visualisation.
•Performed Gap analysis to detail the performance of the application against the business requirements.
•Providing Ad hoc analysis for statistical modelling and analytic reports to answer specific business query.
•Captured Modelling requirements from Senior Stakeholders to Bench functional requirements for SAS/ R Python
•Performed Data Manipulation and Aggregation from Various source including HDFS.
•Creating various B2B Predictive and descriptive analytics using R and Tableau to improve the experience of Small Business Insurance Plans.
•Used pandas, numpy, seaborn, scipy, matplotlib, scikit-learn in Python for developing various machine-learning algorithms.
•Designed and tested Predictive Algorithms using Historical Data to predict better future prices and promotional campaigns.
•Utilized machine learning algorithms such as Decision Tree, linear regression, multivariate regression, Naive Bayes, Random Forests, K-means, & KNN to understand the sales of various insurance plans and also to get an insight of claims based on different conditions like demography, customer income, natural disasters.
•Parsing data, producing concise conclusions from raw data in a clean, well-structured and easily maintainable format.
•Responsible for Big data initiatives and engagement including analysis, brainstorming, POC, and architecture.
•Worked on different data formats from different vendor systems with data formats like JSON, XML.
•Worked on Map Reduce/Spark Python modules for machine learning& predictive analytics in Hadoop AWS.
•Worked with (Tableau) Report Writers to Test, Validate Data Integrity of Reports
Environment: Anaconda Distribution, Jupiter notebook, Pandas, Visio, Excel, R Studio, TDP, Rally, Windows.
Accenture Private limited, Peoria, IL
Role:Modeling Analyst
Aug’ 10 – Dec’ 2011
Objective: Modelling Individual and Family Insurance rate based on Income and Demography for Market campaigning.
Responsibilities:
Prepared financials reports, analysis of month on month variance and provided reports to management on a weekly, monthly and quarterly basis
Scrutiny of General Ledger on monthly basis and do a financial analysis of the same. Provide the information in an appropriate manner to the management
Monitored costs & trend analysis and providing updates on variances to respective stakeholders
Prepared annual budget and periodic forecast/outlook and Generating Insights
Partnered with Leadership to understand the requirement and provide reports, Analyze business units
Recommended solutions for areas of concern with financial data.
Provided ad-hoc reports as per requirement from time to time
Environment: Visio, SQL server, Excel, PowerPoint, Tableau, HPQC, Visio.
Technical Skills
Languages
Python, R, Scala, PIG, MATLAB, Java, C, C++, SQL, UML
Data Warehousing Tools
Informatica, SAP BW/HANA, Datastage
Testing Tools
TDP, IPB Rational Clear Quest, Jira, Rally
Big Data Tools
Hadoop Stack, Apache Spark
Reporting & Visualization
Tableau, Matplotlib, Seaborn, ggplot
Databases
Oracle, MySQL, NoSql, Couchbase, Cassandra
OS
Unix, Linux, Windows, Mac
Education
Masters in Computer Science Engineering at St Cloud State University.
Bachelors in Computer Science Engineering at OU, Hyderabad.