Specific protocols of game-theoretic probability can be used to test hypotheses. This yields a fundamental tool in game-theoretic statistics.
Suppose we receive data and we are testing the hypotheses (sequential hypothesis testing):
This is a general framework which encapsulates both point nulls (eg ) and composite nulls (eg for some set ). and can be infinite sets of distributions, singletons, finite sets, etc.
The game proceeds as follows. Skeptic begins with wealth . For :
- Skeptic proposes a payoff function such that
- Reality reveals an observation .
- Skeptic updates her wealth as
As usual in game-theoretic statistics, this implies that is an e-process for . Under the null, the skeptic’s wealth is therefore not expected to grow. Indeed, by Ville’s inequality,
Therefore, if we reject the null when the exceeds , we have only an probability of making an error, i.e., our type I error uniformly over time is bounded by . In other words, this is a level- sequential test. (One can also use the randomized Ville’s inequality to test at a stopping time).
While this is framed as one-sample testing, we can also consider testing by betting—two-sample testing.
Payoff functions
Clearly, it is important to find good payoff functions. This constitutes a large area of study. While is designed to be a supermartingale under the null, we want it to grow as large as possible as quickly as possible under the alternative so that the null is rejected quickly.
Typically when designing betting strategies, we follow the approach of maximizing log-wealth, i.e., we want to grow fast under the alternative. Though see growth rate conditions in sequential testing for a more general overview.
The payoff is admissible if a.s. under implies that . If is admissible, then (as opposed to an inequality). This means the wealth process is a martingale (as opposed to a supermartingale). (This is easy to check: Set , which satisfies ).
So how do we actually choose ? This choice depends on the problem at hand, whether it’s simple null vs simple alternative (testing by betting—simple vs simple), simple null vs composite alternative (testing by betting—simple vs composite), or composite null vs composite alternative (testing by betting—composite vs composite, in which case we use universal inference or the reverse information projection (RIPr).