Empirical Risk Minimization

Extremely general, and well-subscribed to, framework for how to pick a good model in learning theory. Suppose we have some set of models $F$ . Given training data, a natural way to choose a predictor $f \in F$ is to minimize the empirical risk:

\hat{f} \in argmin_{f \in F} R_{n} (f),

where $R_{n} (f)$ is the empirical risk (see statistical decision theory).

You can prove bounds on the performance of ERM compared to the best classifier in $F$ via PAC learning or PAC-Bayes bounds, though the former is more common. Note that because $f$ is data-driven, one apply usual concentration inequalities to argue about $∣ R_{n} (f) - R (f) ∣$ . One needs to use different machinery, such as eg uniform convergence bounds.

The Stats Map

Explore

empirical risk minimization

Graph View

Backlinks

Explore