A scoring rule is a function used to evaluate forecasts. Let be an outcome space of interest, and the set of distributions over . The goal of a forecaster is to produce some which is “good”, in some sense.
To measure whether is good, we introduce a scoring rule , which takes in the forecaster’s distribution and an outcome and says how good was. Usually is taken to be “positively oriented,” meaning larger values of imply better forecasts.
Given , the expected score between is
A scoring rule is proper if for all ,
In words: Suppose you as the forecaster knew that the “true” distribution was . If is a proper scoring rule, it means that you can play to maximize your score. This is so intuitive it’s almost painful.
Examples of proper scoring rules for binary forecasts are (note that in this case):
- Brier score:
- Spherical score:
- Logarithmic score:
- 0-1 score: .