Squared Error

Exactly what it sounds like. Worth a note just because it keeps coming up.

Actually, you know what, the bias-variance decomposition is possibly worth discussing. Given a predictor $f : X \to R$ and a target $f^{*} : X \to R$ , the squared error can be decomposed as

E [(f (X) - f^{*} (X))^{2}] = Bias^{2} (f) + V (f) + σ^{2},

where $Bias (f) = E [f (X) - f^{*} (X)]$ , $V (f) = E [(f (X) - E [f (X)])^{2}]$ and $σ^{2}$ is the irreducible noise in the data (i.e., given $X$ , outcomes are drawn as $Y = f^{*} (X) + ϵ$ where $ϵ$ has variance $σ^{2}$ .

As you increase model complexity, bias typically decreases and variance increases. This is known as the bias-variance tradeoff. So to minimize squared-error, you want to find a appropriate compromise between the two.

The Stats Map

Explore

squared error

Graph View

Backlinks

Explore