Sign in

Data Scientist/Developer

Atlanta, GA
March 28, 2018

Contact this candidate


Xiao Cheng

Email: Phone: 678-***-****

Statistical Analysis

Technical Skills

Other Qualifications


Data Mining


Chess Master

CAS Actuarial Exams

Regression Analysis


Strategic Thinker

Auto/Property Insurance

Machine Learning


Mandarin Chinese

Frequency/Severity Models

Big Data (Hadoop, Spark)


Quick Learner


Predicative Modeling


Team Player

Rate Making


Anthem Inc. August 2016 – Present

Hadoop Developer/Data Scientist Atlanta, GA

Perform Data Ingestion, ETL, and Analytics for Healthcare Program Integrity use case

Architecture Discussion from POC to Production: Converting HQL files to ETL workflow

Optimize Impala/Hive/Spark code to reduce Production run time by 30%

Improve QA steps thru automation tests and eliminate 80% of manual tests

Reduce Tableau visualization performance by 33% from back-end code changes

Build and manage BitBucket Repository to improve the collaboration between teams

Automate Shell scripts to simplify ETL process and eliminate errors

Provide recommendations to business stakeholders in cost saving by identify target cases

Facilitate discussions to implement IT SDLC cycle for analytical models development

Teradata Corporation March 2014 – August 2016

Data Scientist Consultant (Advanced Analytics) Atlanta, GA

Telecommunication, Financial Services, and Hospitality

Identified over 5% of churn risks by applying behavior patterns

Provide text analytics and captured 40% of customer un-satisfaction thru text analysis

Implemented code to capture customer plan changes from monthly to daily view

Build machine learning models and studied correlation effects for customer surveys

Designed the process flow to pull Twitter data using R and Aster

Used kmeans analysis for customer segmentations and behaviors

Created decision tree/GLM models and interpreted model results for business users

Provide ETL and path analysis on Internet of Things device for health care data

Statistics Master’s Thesis Athens, GA January – November 2013

Performed machine learning analysis including CART, Random Forest, and Boosting in R

Worked on a data set that predicts the length of stay in hospital for patients based on claim records


Teaching chess in-person and online to bring more interest to the game

Help students increase US Chess rating 500-1500 points

2006 Georgia State Chess Champion; 2005 Supernationals K-12 National Chess Champion


Zurich Insurance Actuarial Intern Schaumburg, IL May – August 2013

Analyzed correlations in SAS between existing experience and related model variables

Passed Actuarial Exams Probability/P1, Financial Mathematics/FM2, and Financial Economics/MFE3


Master of Science in Statistics; University of Georgia December, 2013

Overall GPA: 3.54/4.00. The UGA Honors Program, Dean’s List, and HOPE Scholarship Athens, GA

Contact this candidate