The Neyman-Pearson paradigm is formulated in terms of Type I and Type II errors. Suppose we are testing the null $H_{0}$ vs the alternative $H_{1}$ (see hypothesis testing). In particular, we focus on constructing hypothesis tests $ϕ$ such that

P_{H_{0}} (ϕ (X^{n}) = 1) \leq α,

for some pre-specified $α$ . Here $X^{n} = (X_{1}, \dots, X_{n})$ are the observations. (Note the pre-specification is important; see issues with p-values. This motivates post-hoc hypothesis testing). Subject to this constraint we want to maximize the power of the test, i.e.,

β = P_{H_{1}} (ϕ (X^{n}) = 1) .

Wald was the first (I think) to formulate NP in terms of decision-theory. Introduce a loss function $ℓ$ with two parameters, $ℓ (a, b)$ , where $a \in {0, 1}$ represents the null and alternative hypothesis, and $b \in {0, 1}$ represents the action (accept or reject). Presumably one has $ℓ (0, 0) = ℓ (1, 1) = 0$ .

We restate the type-I error guarantee as a Type-I risk guarantee:

E_{H_{0}} [ℓ (0, ϕ (X^{n}))] \leq 1.

Note that $E_{H_{0}} [ℓ (0, ϕ (X^{n}))] = P_{H_{0}} (ϕ (X^{n}) = 1) ℓ (0, 1)$ so in order to recapture the type-I error guarantee, we can take $ℓ (0, 1) = 1/ α$ , in which case

E_{H_{0}} ℓ (0, ϕ (X^{n})) \leq 1 \Leftrightarrow P_{H_{0}} (ϕ (X^{n}) = 1) \leq α .

To recover the notion of power, introduce type-II risk, which is $E_{H_{1}} [ℓ (1, ϕ (X^{n}))] .$ Write

E_{H_{1}} [ℓ (1, ϕ (X^{n}))] = (1 - β) ℓ (0, 1),

which relates type-II risk to power. We want to minimize type-II risk.

Using losses allows us to generalize the NP paradigm beyond binary decisions (accept/reject) and to consider more general decision spaces. Eg we can consider $ℓ (a, b)$ for $b \in B$ . This enables post-hoc hypothesis testing, as Grunwald studies.

References

Contributions to the theory of statistical estimation and testing hypotheses, Wald 1936.
Beyond Neyman-Pearson, Grunwald 2024.

The Stats Map

Explore

Neyman-Pearson paradigm with losses

References

Graph View

Backlinks

Explore