Post Job Free
Sign in

R, SAS, SQL, Spark, Hadoop, Python, Java, Hive

Location:
Chicago, IL
Posted:
November 26, 2015

Contact this candidate

Resume:

CAREER SUMMARY

Seven-year experiences in developing advanced analytical toolkits and algorithms using SAS and R;

Hands-on programming experiences in Java, Python, C++, Hadoop and big data modeling;

Excel in data manipulation using SQL, Hive, Teradata and driving analyses to actionable conclusions;

Hands-on experiences in statistical analysis such as generalized linear models, and forecasting methods;

Familiar with machine learning such as SVM, random forest, decision tree and neural network;

Knowledge of marketing data analysis such as CRM, ROI, and campaign testing and validation;

Great communication abilities gained from leading five projects, consulting and teaching experiences.

EDUCATION

Ph.D. in Civil Engineering (specialties: data mining and spatial statistics) 5/2012

University of Illinois at Urbana-Champaign, GPA: 3.8/4.0

M.S. in Environmental Science, with honors 6/2004

Peking University, Beijing, China, GPA: 3.7/4.0

B.S. in Environmental Engineering, with honors 6/2001

Wuhan University, Hubei, China, GPA: 3.4/4.0

EMPLOYMENT

Data and Information Specialist, University of Illinois at Urbana-Champaign, Full-time 4/2015-present

Establish a library of machine learning and parallel computing methods for financial engineering department.

Create anti-money laundering detection model and improve risk rating system using SQL, Pythonl and Hive.

Implement large-scale numerical models using Hadoop, Spark and MLLib and achieve cloud computing badge.

Develop stochastic pricing model for asset management and implement Monte Carlo simulation.

Supervise eight practicum projects, provide technical support and update sponsors progress weekly.

Instruct sixty graduate students to script with SQL and Python and statistical packages such as SAS and R.

Sr. Data Scientist, FedEx Services, Full-time 10/2013-3/2015

Architected and implemented three analytical toolkits for timely and accurate statistical analysis using SAS.

Designed and led big data algorithms using Hadoop and Java to improve forecasting accuracy by 18%.

Created cron jobs in Unix and streamlined data aggregation from data warehouse using SQL and Teradata.

Processed data with Hive and Teradata, and developed web applications using Java and Oracle SQL.

Presented weekly strategic initiatives with Tableau for VP about top twenty clients and actionable conclusions.

Modeler/Programmer, CB&I /The Shaw Group, Full-time 8/2012-9/2013

Developed Monte Carlo simulation of biodegradation using Java and helped client to quantify risks with R.

Implemented customer segmentations and provided analytical expertise for customer retention and cross selling.

Programed a data interpolation toolkit using C/C++ and processed large datasets using Python and SQL.

Analyzed three million potential sites using kmeans clustering for large-scale site selection under uncertainties.

Applied Bayesian statistical toolkits to increase prediction accuracy by 24% with insufficient historical data.

Presented the progress of five projects to clients weekly and coordinated three projects with six team members.

Statistical Intern, State Farm, 10 hours/week for 1/2012-5/2012 and 20 hours/week for 6/2012-8/2012

Wrote statistical programs for data analysis using SAS such as multivariate regression with insurance data.

Implemented customer segmentations and provided analytical expertise for customer retention and cross selling.

Presented findings of insurance claim analysis to the director and managers of underwriting department.

Research Assistant/Computer lab Assistant, University of Illinois, 20 hours/week 8/2007- 12/2011

Programmed decision trees to cross analyze model results and quantified goodness of fit with AIC index.

Developed a two-dimensional statistical toolkit with 2-d moving average method and saved 80% time.

Developed a new fast method for large-scale spatial optimizations and saved 60% computation time.

Taught weekly computer lab, managed progress of five exercises and set reasonable milestones for forty students.

AWARDS & HONORS

ESRI-GIS Development Center Software Development Award at the University of Illinois 2011

Outstanding student paper in Earth Informatics in American Geophysical Union Fall Meeting 2007

Best student poster in the conference of Illinois Water 2006

PUBLICATIONS & PRESENTATIONS

I have published ten papers in large-scale computer-aided engineering, optimization under uncertainty and software development. More details will be given upon request.

I demonstrated my developed software, orally presented eleven conference abstracts including international awards and an invited talk at INFORMS conference, 2011. More details will be given upon request.



Contact this candidate