A Bayesian (see Bayesian statistics) approach to nonparametric regression.
We say a stochastic process is a Gaussian process if, for any finite collection ,
where is a Mercer kernel and . Typically we suppose that since we can always subtract the mean from the data.
Gaussian process regression places a prior on the regression function which assumes that is a Gaussian process. Thus, assuming , the prior has density
where .
Predictive Distribution
Suppose we have observed training data and observe a new point . We have
if . Then and , so