You’re doing some statistical inference on a Friday night, as one does. You compute some estimate of the true parameter . But how do you know if it’s any good?

You could look at whether it’s consistent, or whether it’s unbiased, or whether , obeys a CLT around the true parameter. But who’s to say? The point of statistical decision theory is to provide a formal framework for answering this question.

One begins with a loss function which measures how good the estimator is. (eg is the ol’ classic squared error. There’s also error, loss, zero-one loss, etc.) The risk of an estimator the expected loss, i.e.,

Note that is fixed; the expectation is with respect to the randomness of the data.

It’s tempting to compare various estimators, , by comparing their risks. The one with the lower risk wins, right? But might have lower risk than on some parameters and higher risk on others (in fact, this is almost certainly the case for any reasonable estimators). In that case how do we say which one is better?

The two most common strategies are to consider the maximum risk and the Bayesian risk.

  • The maximum risk considers the worst case over all parameters , . The estimator minimizing the maximum risk is the minimax estimator. A huge swath of modern statistics is taken up with the question of determining minimax rates and finding minimax estimators.
  • The Bayesian risk puts a prior over and then considers the Bayes risk: . There’s lots more to say here, see Bayesian decision theory.