Cdf Concentration

What can we say about the concentration of the empirical CDF about the true CDF?

Scalar DKW inequality

For real valued observations $X_{1}, \dots, X_{n}$ with $F_{n} (x) = \frac{1}{n} \sum_{i \leq n} 1 (X_{i} \leq x)$ , the DKW (Dvoretzky-Kiefer-Wolfowitz) inequality states that

P (x \in R sup ∣ F_{n} (x) - F (x) > t) \leq 2 exp (- 2 n t^{2}) .

Note this statement is uniform in $x$ which, depending on your expectations, is kind of remarkable. The intuition is that if $F (x)$ sharply increases (it can’t decrease, of course) at some $x$ , then these are precisely the places where we expect to see many observations, so $F_{n} (x)$ won’t be too far from $F (x)$ .

The original DKW inequality was an asymptotic statement: it bounded the probability as $≲ exp (- 2 n t^{2})$ . In 1990, Paul Massart proved that the constant on the right hand side was 2, and he proved that this was tight.

Multivariate DKW inequality

In 2021, Naaman gave a multivariate extension of DKW. Namely,

P (x \in R^{d} sup ∣ F_{n} (x) - F (x) > t) \leq d (n + 1) exp (- 2 n t^{2}) .

For sufficiently large $n$ , $n + 1$ can be replaced by $2$ in which case the constant in front of the exponential is $2 d$ , which is optimal. Here the empirical CDF is as above, but $1 (X_{i} \leq x$ ) is interpreted component-wise: to be true each component of $X_{i}$ must be at most $x_{i}$ .

In 1977, Devroye gave a similar result but with $2 e^{2} (2 n)^{d}$ replacing $d (n + 1)$ .

Both the univariate and multivariate DKW inequalities hold under slightly weaker assumptions than the usual iid assumption.

Glivenko-Cantelli theorem

Glivenko and Cantelli proved that, in the scalar case,

x \in R sup ∣ F_{n} (x) - F (x) ∣ = ∥ F_{n} - F ∥_{\infty} a . s . 0.

The DKW inequality implies almost sure convergence (via the Borel-Cantelli) lemma, so the Glivenko-Cantelli theorem is strictly weaker than the DKW inequality, which provides explicit rates of convergence. But the Glivenko-Cantelli theorem is part of a broader conversation about Glivenko-Cantelli classes in empirical process theory.

The Stats Map

Explore

cdf concentration

Scalar DKW inequality

Multivariate DKW inequality

Glivenko-Cantelli theorem

Table of Contents

Graph View

Backlinks

Explore