One solution to the problem of testing by betting—composite vs composite.

In sequential hypothesis testing, suppose we are testing a composite null. Universal inference is one way to construct e-values and e-processes in this setting. It was introduced by Wasserman, Ramdas, and Balakrishnan (2020).

Let $X_{1}, X_{2}, \dots$ be a stream of observations. We are testing the composite null $P$ vs a composite null $Q$ (though of course this could be a simple null).

Split UI

Consider the fixed-time setting, where we have observations $X_{1}, \dots, X_{n}$ . The most basic UI procedure is split UI. The general split UI procedure is the following:

split data into $D_{0}$ , $D_{1}$ .
pick some $q$ using $D_{1}$ .
Compute $sup_{p \in P} p (D_{0})$ using $D_{0}$ .

The UI e-value is

E_{t} = \frac{q ( D _{0} )}{sup _{P \in P} p ( D _{0} )} .

If the MLE is well-defined, then the denominator is the MLE. Since we’re taking the supremum, it’s easy to see that

E_{P} E_{t} \leq E_{P} [\frac{q ^ ( D _{0} )}{p ( D _{0} )}] = 1,

where we use that $q$ is independent of $D_{1}$ due to the data-splitting.

Note that no regularity conditions were required. This approach can be applied to irregular problems in hypothesis testing, which many traditional methods struggle with. This is why the method is considered universal.

$E_{t}$ is an e-value for any distribution $q$ . But the choice of $q$ matters for the power of the test. As in testing by betting—simple vs composite, $q$ can be chosen via the method of mixtures or the plug-in method.

The randomness can be minimized by doing this over several splits of the data, not just one.

UI e-process

The above e-value doesn’t immediately lend itself to sequentialization. Here’s how to construct an e-process using similar ideas.

Let $q_{t}$ be any distribution chosen using the first $t$ observations. Consider

E_{t} = \frac{\prod _{i \leq t} q _{i - 1} ( X _{i} )}{\prod _{i \leq t} p _{t} ( X _{i} )},

where $p_{t}$ is the MLE based on $X_{1}, \dots, X_{t}$ . This is an e-process under $P$ , which can be seen by upper bounding $E_{t}$ using the MLE in the denominator. This is the plug-in-method. We can also consider a mixture method, where we integrate the alternative densities over some distribution. Similar ideas occur in testing by betting—composite vs composite and also when constructing confidence sequences: confidence sequences via conjugate mixtures (mixture method) and confidence sequences via predictable plug-ins (plug-in method).

Tartatovsky actually wrote down this e-process in his 2014 book, Sequential Analysis.

The Stats Map

Explore

universal inference

Split UI

UI e-process

Table of Contents

Graph View

Backlinks

Explore