Scalar Heavy-Tailed Mean Estimation

Unlike light-tailed settings (light-tailed, unbounded scalar concentration and bounded scalar concentration) the sample mean is not well-behaved in heavy-tailed settings. Since heavy-tailed distributions may not have finite MGFs, the Chernoff method is not applicable. Catoni gives an example demonstrating the bound achieved via Markov’s inequality (basic inequalities), i.e.,

P (∣ \overline{X}_{n} - μ ∣) \geq \frac{σ}{n δ}) \leq δ,

is essentially tight (where we receive observations $X_{1}, \dots, X_{n}$ and $\overline{X}_{n}$ is the sample mean). The issue is that outliers can have devastating effects on the sample mean, and heavy-tailed distributions can have many extreme observations. See this discussion by Lugosi and Mendelson for more details, or Sub-Gaussian mean estimators by Devroye et al.

Ideally we want estimators with sub-Gaussian like behavior, i.e.,

P (∣ μ - μ ∣ ≳ σ \frac{lo g ( 1/ δ )}{n}) \leq δ .

This is an exponential improvement in the dependence on $1/ δ$ . These are called sub-Gaussian estimators.

There are several approaches to heavy-tailed mean estimation in scalar settings:

The Stats Map

Explore

scalar heavy-tailed mean estimation

Graph View

Backlinks

Explore