A Bayesian (see Bayesian statistics) approach to nonparametric regression.

We say a stochastic process is a Gaussian process if, for any finite collection ,

where is a Mercer kernel and . Typically we suppose that since we can always subtract the mean from the data.

Gaussian process regression places a prior on the regression function which assumes that is a Gaussian process. Thus, assuming , the prior has density

where .

Predictive Distribution

Suppose we have observed training data and observe a new point . We have

if . Then and , so