Proper Scoring Rule

A scoring rule is a function used to evaluate forecasts. Let $X$ be an outcome space of interest, and $Δ (X)$ the set of distributions over $X$ . The goal of a forecaster is to produce some $P \in Δ (X)$ which is “good”, in some sense.

To measure whether $P$ is good, we introduce a scoring rule $S : Δ (X) \times X \to R$ , which takes in the forecaster’s distribution $P$ and an outcome $x \in X$ and says how good $P$ was. Usually $S$ is taken to be “positively oriented,” meaning larger values of $S$ imply better forecasts.

Given $S$ , the expected score between $P, Q \in Δ (X)$ is

S (P ∥ Q) := E_{x \sim Q} S (P, x) .

A scoring rule $S$ is proper if for all $Q \in Δ (X)$ ,

Q \in argmax_{P \in Δ (X)} S (P ∥ Q) .

In words: Suppose you as the forecaster knew that the “true” distribution was $Q$ . If $S$ is a proper scoring rule, it means that you can play $Q$ to maximize your score. This is so intuitive it’s almost painful.

Examples of proper scoring rules for binary forecasts are (note that $P = p \in [0, 1]$ in this case):

Brier score: $S (p, x) = 1 - (p - x)^{2}$
Spherical score: $S (p, x) = \frac{p y + ( 1 - p ) ( 1 - x )}{p ^{2} + ( 1 - p ) ^{2}}$
Logarithmic score: $S (p, x) = x lo g (p) + (1 - x) lo g (1 - p)$
0-1 score: $S (p, x) = x 1 {p \geq 0.5} + (1 - x) in d {p \leq 0.5}$ .

The Stats Map

Explore

proper scoring rule

Graph View

Backlinks

Explore