Skip to content

syrte/robustgp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

robustgp

Robust Gaussian Process Regression Based on Iterative Trimming (ITGP)

  • Nonparametric, flexible, and robust nonlinear regression based on Gaussian process.
  • Outperforming the robust Gaussian process with Student’s-t likelihood significantly in many test cases.

The Gaussian process (GP) regression can be severely biased when the data are contaminated by outliers. ITGP is a new robust GP regression algorithm that iteratively trims the most extreme data points. While the new algorithm retains the attractive properties of the standard GP as a nonparametric and flexible regression method, it can greatly improve the model accuracy for contaminated data even in the presence of extreme or abundant outliers. It is also easier to implement compared with previous robust GP variants that rely on approximate inference. Applied to a wide range of experiments with different contamination levels, the proposed method significantly outperforms the standard GP and the popular robust GP variant with the Student-t likelihood in most test cases.

Install

pip install robustgp

Dependency: GPy

Quick start

One can start with examples in this notebook.

Usage

from robustgp import ITGP

# train ITGP
res = ITGP(X, Y, alpha1=0.5, alpha2=0.975, nsh=2, ncc=2, nrw=1)
gp, consistency = res.gp, res.consistency

# make prediction
y_avg, y_var = gp.predict(x_new)
y_var *= consistency

Here gp is a GPy.core.GP object, whose usage is further illustrated here.

Benchmark

Comparison with standard Gaussian process and t-likelihood Gaussian process in terms of RMSE. See the algorithm paper for details.

Benchmark

References

License

The MIT License