Statistical Decision Theory

You’re doing some statistical inference on a Friday night, as one does. You compute some estimate $θ$ of the true parameter $θ$ . But how do you know if it’s any good?

You could look at whether it’s consistent, or whether it’s unbiased, or whether $θ - θ$ , obeys a CLT around the true parameter. But who’s to say? The point of statistical decision theory is to provide a formal framework for answering this question.

One begins with a loss function $L : Θ \times Θ \to R_{\geq 0}$ which measures how good the estimator is. (eg $L (θ, ϕ) = (θ - ϕ)^{2}$ is the ol’ classic squared error. There’s also $ℓ_{1}$ error, $L_{p}$ loss, zero-one loss, etc.) The risk of an estimator $θ (X)$ the expected loss, i.e.,

R (θ, θ) = E_{X \sim P_{θ}} R (θ, θ (X)) .

Note that $θ$ is fixed; the expectation is with respect to the randomness of the data.

It’s tempting to compare various estimators, $θ_{1}$ , $θ_{2}$ by comparing their risks. The one with the lower risk wins, right? But $θ_{1}$ might have lower risk than $θ_{2}$ on some parameters $θ$ and higher risk on others (in fact, this is almost certainly the case for any reasonable estimators). In that case how do we say which one is better?

The two most common strategies are to consider the maximum risk and the Bayesian risk.

The maximum risk considers the worst case over all parameters $θ$ , $M (θ) = sup_{θ} R (θ, θ)$ . The estimator minimizing the maximum risk is the minimax estimator. A huge swath of modern statistics is taken up with the question of determining minimax rates and finding minimax estimators.
The Bayesian risk puts a prior $π$ over $Θ$ and then considers the Bayes risk: $B (θ) = \int R (θ, θ) π (θ) d θ$ . There’s lots more to say here, see Bayesian decision theory.

The Stats Map

Explore

statistical decision theory

Backlinks

Graph View

Recently Updated

regret minimization

infinitely divisible distribution

best-arm identification

characteristic function

Bayes factors

adjusters

multi-armed bandits

Explore