Sign in


Naperville, Illinois, United States
January 05, 2018

Contact this candidate



Email:, Mobile: +1-785-***-****

Availability: Immediate

Profile of Qualifications

Data extraction, Data cleaning, Data mining, Data visualization and Predictive modeling.

Expert in SQL and Advanced Excel (VLOOKUP, Pivot Tables, VBA).

Proficient in setting up Linux OS and commands.

Strong understanding of Data warehouse ETL Kettle Transformations, Dimensional star schema (Pentaho).

Strong Knowledge on Google Analytics and Google AdWords.

Well versed in machine learning algorithms such as Linear and Logistic Regression, Time Series Analysis, Decision Trees, Random Forest, Support Vector Machines, K nearest neighbors, Naïve Bayes and Apriori.

Hands on experience in conducting statistical analysis and data mining activities using SPSS modeler.

Strong knowledge on KPI’s involved in CRM applications to perform data analysis.

Strong skills performing data analysis, impact analysis, risks analysis, writing test cases for User Acceptance Testing. Gathered Functional and Data Requirements, analyzed workflows and created Use Cases, Requirement Specifications, Report Specifications, Data Requirements, Data Mappings and Data Flow Diagrams.

Strong knowledge and demonstrable understanding on QA procedures, methods and techniques. Experience in manual and automation testing (HP QTP)


Illinois Institute of Technology, Chicago, IL May 2017

Master of Information Technology Management GPA 3.8

V R Siddhartha Engineering College, Vijayawada, Andhra Pradesh, India May 2011

Bachelors of Technology GPA 3.5

Technical Skills

Programming Languages: JAVA, SQL, PL/SQL, VBA, Linux, R and SAS/BASE/Macro/SQL/XML, Python

Microsoft: Access, Excel, PowerPoint, Word and Visio

Databases: Oracle 11g, Oracle 12c, MY SQL, MS, Access, Microsoft SQL server, Excel

Tools: JIRA, HP QC, HP QTP, SQL Developer, Toad, Tableau, Pentaho, SAS, SPSS, SSIS, SSRS

Methodologies: SDLC, STLC, ITIL, Agile/Scrum


SAS Certified Base Programmer, License Number: BP068052v9

SAS EBI certification for Experience users

Google Analytics Certified, License Number: 998-***-****

DataCamp Certified Intermediate to R, License Number: 0bf086a0ceb9cc96a0acd533cb8db9fb6071e802


Statistician – CES LLC (Client NCI AdvanceMED) May 2017 - Present

Communicate effectively with internal and external customers, including federal law enforcement officers.

Validate data analysis results and analytically identifies potential fraud, waste and/or abuse situations in violation of Medicare/Medicaid laws, guidelines, policies, and regulations.

Supports management requests for CMS ZPIC Zone 2 reporting requirements.

Utilizes data analysis techniques to detect aberrancies in Medicare/Medicaid claims data and proactively seeks out and develops leads and cases received from a variety of sources including CMS and OIG, fraud alerts, and referrals from government and private sources.

Work with Statisticians and Sr. Data Analysts to provide proactive data analysis results with statistically high probabilities of producing case referrals to law enforcement, overpayments, and/or administrative actions.

Prepare, develop and participate in provider, beneficiary, law enforcement, or staff training as related to Medicare fraud, waste and/or abuse data analysis.

Advanced knowledge of health care data (e.g., CPT/HCPS, ICD9/ICD10).

Comply with and maintain various documentation and other reporting requirements as needed.

Tools Used: SAS, R, Excel, VBA

Sr Engineer/Data Analyst - IGATE GLOBAL SOLUTIONS Sep 2011 – July 2015

Reading data from different sources like .csv, excel, MS Access, tab delimited files and Oracle databases.

Analyzed data on Medicaid Fraud such as finding duplicate claims, service after death, or multiple service providers.

Led management through detailed analysis of production results, compared with external healthcare trends on utilization and cost of healthcare.

Analyzed data related to inpatient/outpatient facility claims and professional claims.

Analyze provider reimbursements to identify strategies to improve access and profitability in a manner that is consistent with quality, cost effective care.

Experience in creating the tables and sequences for the experimental data load capture. Loaded the data into the tables using TOAD and SQL*Plus. Created metadata validation lookup tables and pre-populated them using SQL Loader generator application.

MS Excel Pivot Tables, nested arrays, logical formulas, macros, VBA and dashboard reporting to internal and external head management; used for lead decision making reporting.

Developed management reports and assisted in presenting important issues to management in the form of presentations using Tableau.

Daily preparation and presentations for C-level management on weekly, monthly and yearly production numbers, with dynamic views in MS PowerPoint and MS Excel

Work with statistician to analyze the results obtained from various statistical procedures like PROC ANOVA, GLM, and T-test.

Academic Projects

Advanced Topics in Data Management: Database Model Nov - Dec 2015

Designed a Conceptual/Logical Database model for an online store using SQL.

Created dashboards in targeting profitable revenue market through analyzing data on client purchases.

Object Oriented App Development: Web Application Nov - Dec 2015

Created a Web application for online car showroom to tend to client’s specifications for point of sale.

Developed the application by creating MVC architecture.

Data Analytics: R and SAS (Text Analytics) Feb - May 2016

Extracted the Executive Compensation Data, Q&A section from DEF 14A SEC filings of Ford Motors.

Identify key performance indicators and predict the major decisions and outcomes in the organization.

Identifying tax strategies for the states of USA based on household incomes, property values, Workers in Family.

Data Mart Application – Kimball Approach Feb - May 2016

Create a database using MYSQL and dimensional star schema with the help of role playing dimensions, junk dimensions and bridge tables.

Implemented ETL kettle transformations using Pentaho.

Analysis and data visualization reports were created using Tableau from which conclusions are drawn.

Data Mining: Association rule Mining using R Sep – Oct 2016

Predicted what sorts of people are likely to survive on one of the most infamous shipwrecks in history TITANIC by applying Apriori algorithm. Created a decision tree visualization and derived the reports using arulesViz package, scatter plots.

Participated in kaggle competition over the same challenge.

Recommendation Engine – Predictive analysis on airlines Oct – Dec 2016

Analyze the data to find best attributes suited for the algorithms and pre-process accordingly to make the model fit.

Implemented KNN and Naïve Bayes algorithms using R to predict the airline delay and create reports using R statistical packages. Accuracy of the models is calculated using Cross table validation to find the best suited model for prediction.

Implemented the logistic regression(scikit-learn) analysis on the same project in python.

Time Series Analysis – Illinois unemployment rate Feb – Apr 2017

Developed R scripts, cleaned and de-trended the unemployment time series data, eliminated non- stationarity by applying transformation to reduce it to a model-able form.

Successfully selected the best fit SARIMA model by tuning parameters to find the minimum AIC, BIC and MSE, perfectly fitted the data with very small error and adequately forecast future.

Estimated and forecasted the data with Spectral Model Analysis and VAR model.

Contact this candidate