Let be some parameter space and be a family of functions from to . The integral probability metric (IPM) with respect to between two distributions and is the distributional distance

IPMs recover the total variation distance, the Wasserstein distance, the KS distance, and the kernel MMD under different choices of . is often taken to be symmetric so that we can get rid of the absolute value above.

IPMs and f-divergences intersect (non-trivially) only at the total variation distance. IPMs are convex and can thus be sequentially estimated by confidence sequences for convex functionals. See Sriperumbudur et al for more on plug-in estimation.