Functions which capture “distances” between probability distributions. They may not be metrics in the formal sense (metric space) (eg perhaps they’re not symmetric as in the KL divergence).

Examples:

General families of divergences include f-divergence, alpha-divergence, and integral probability metric.

Some definitions

Different authors have different notation: Some write distances as functions of the distributions themselves, eg $ρ (P_{1}, P_{2})$ , and some write them as functions of random variables, $ρ (X_{1}, X_{2})$ .

A metric $ρ$ is regular if $ρ (X + Z, Y + Z) \leq ρ (X, Y)$ for any $Z$ independent of $X$ and $Y$ . This captures the notion that blurring observations by independent noise makes them harder to distinguish, i.e., decreases the distance between them.

Regularity is equivalent to sub-additivity:

ρ (X_{1} + X_{2}, Y_{1} + Y_{2}) \leq ρ (X_{1}, Y_{1}) + ρ (X_{2}, Y_{2}), for X_{1} ⊥ X_{2}, Y_{1} ⊥ Y_{2} .

A metric is homogeneous of order $s$ if

ρ (c X, c Y) = ∣ c ∣^{s} ρ (X, Y) for all c \in R .

Ideal metrics of order $s$ are simultaneously regular and homogeneous of order $s$ . These come up in the study of central limit theorems (see quantitative CLT template with ideal metrics).

The Stats Map

Explore

distributional distance

Some definitions

Graph View

Backlinks

Explore