Let $(S_{t})$ be a martingale wrt to the filtration $(F_{t})$ . Assume $(S_{t})$ is scalar-valued unless otherwise indicated. Here we investigate concentration inequalities for $(S_{t})$ .

Note that martingale concentration inequalities generalize concentration inequalities for independent random variables (eg bounded scalar concentration), since we may take $S_{t} = \sum_{i \leq t} (X_{i} - μ)$ , in which the following bounds translate into bounds on $\sum_{i \leq t} X_{i}$ .

While we state concrete, mostly fixed-time results here, we note that many of the following bounds were made time-uniform (and often tightened) using sub-psi processes.

Azuma-Hoeffding inequality

Assume that $∣ X_{t} - X_{t - 1} ∣ \leq c_{t}$ for all $t$ , i.e., the martingale has bounded increments. Then, for all $n$ ,

P (∣ X_{n} - X_{0} ∣ \geq ϵ) \leq 2 exp (\frac{- ϵ ^{2}}{2 \sum _{t = 1}^{n} c _{t}^{2}}) .

The natural one-sided versions of this inequality also exist. Note that $n$ is fixed in advance here (i.e., it is fixed-time result).

Dubins-Savage inequality

This is often considered Chebyshev’s inequality for martingales. If $(X_{t})$ has conditional means $μ_{t}$ , i.e., $E [X_{t} ∣ F_{t - 1}] = μ_{t}$ and conditional variances $V_{t} = V (X_{t} ∣ F_{t - 1})$ then for any $a, b > 0$ ,

P (\exists t \geq 1 : i \leq t \sum (X_{i} - μ_{i}) \geq a + b i \leq t \sum V_{i}) \leq \frac{1}{ab + 1} .

This is a time-uniform result. This result can also be generalized to infinite variance. If $v_{t} (p) = E [∣ X_{t} - μ_{t} ∣^{p} ∣ F_{t - 1}]$ for $1 < p \leq 2$ , then

P (\exists t \geq 1 : i \leq t \sum (X_{i} - μ_{i}) \geq a + b i \leq t \sum v_{i} (p)) \leq \frac{1}{( c _{p} a b ^{\frac{1}{p - 1}} + 1 ) ^{p - 1}},

where $c_{p}$ is a constant dependent on $c$ . This was proven by Kahn in 2009.

Variance bound

If the martingale has bounded increments and the variance of the increments are also bounded, i.e.,

E [∣ X_{t} - X_{t - 1} ∣^{2} ∣ F_{t - 1}] \leq v_{t}^{2},

then we can modify Azuma’s bound to read

P (∣ X_{n} - X_{0} ∣ \geq ϵ) \leq 2 exp (\frac{- ϵ ^{2}}{4 V}),

where $V = \sum_{i} v_{i}^{2}$ , as long as $ϵ \leq 2 V / max_{i} c_{i}$ . Why is this better than Azuma’s inequality? Since the increments are bounded by $c_{t}$ , a trivial bound on $E [∣ X_{t} - X_{t - 1} ∣^{2} ∣ F_{t - 1}]$ is $c_{t}^{2}$ . Thus we may assume that $v_{t}^{2} \leq c_{t}^{2}$ , which means the right hand side of the bound is tighter.

This was first proved by DA Grable in A Large Deviation Inequality for Functions of Independent, Multi-way Choices. A modern proof is given by Dubhasi and Panconesi in their textbook, Concentration of Measure for the Analysis of Randomized Algorithms, Chapter 8.

Bentkus-style inequality

Let $(X_{i}$ ) be a supermartingale adapted to $(F_{i})$ . If $a_{i} \leq X_{i} \leq b_{i}$ , then

P (S_{n} \geq u) \leq α \geq 1 in f t in f \frac{E ( S _{n} - t ) _{+}^{α}}{( u - t ) _{+}^{α}} .

Similarly to Bentkus’ inequality for scalar random variables, this improves over the Chernoff method. We can further bound this as

α \geq 1 in f t in f \frac{E ( S _{n} - t ) _{+}^{α}}{( u - t ) _{+}^{α}} \leq α \geq 1 in f t in f \frac{E ( \sum _{i} G _{i} - t ) _{+}^{α}}{( u - t ) _{+}^{α}} \leq t in f \frac{E ( \sum _{i} G _{i} - t ) _{+}}{( u - t ) _{+}} .

where $G_{i} \in {a_{i}, b_{i}}$ and $E G_{i} = 0$ . The right hand side can be computed explicitly, or approximated, in some circumstances. See here.

Bentkus-style inequality for variance

In addition to the boundedness condition above, suppose we also have $V (X_{i}) \leq σ_{i}^{2}$ . Then we can write

P (S_{n} \geq u) \leq α \geq 2 in f t in f \frac{E ( S _{n} - t ) _{+}^{α}}{( u - t ) _{+}^{α}} .

As above, we can bound the right hand side with a worst case distribution. In particular, this time we have $G_{i} \in {- σ_{i}^{2} / b_{i}, b_{i}}$ and $E G_{i} = 0$ .

References

Concentration of Measure for the Analysis of Randomized Algorithms by Dubhashi and Panconesi.

The Stats Map

Explore

martingale concentration