Monge Formulation

A (strict) formulation of the optimal transport problem. It’s always taught with dirt piles; why deviate from tradition.

Consider dirt piles at locations $D_{1}, \dots, D_{n}$ and holes at locations $H_{1}, \dots, H_{m}$ . Suppose pile $D_{i}$ has $α_{i}$ units of dirt and hole $H_{j}$ can hold $β_{j}$ units of dirt. It costs $c (i, j) \geq 0$ to transport a unit of dirt from $D_{i}$ to $H_{j}$ . Our question is: Which dirt pile do we send to which hole in order to minimize the overall cost?

More formally, we want a transportation plan $T : [n] \to [m]$ which maps piles to holes, telling us where to send the $i$ -th dirt pile. Each pile can only be sent to one hole, and at the end of the process, each hole should be full. That is, for each $j$ , we should have $\sum_{i} α_{i} δ_{T (i) = j} = β_{j}$ .

We are thus looking to solve the following problem:

T min {i \sum α_{i} c (i, T (i)) i \sum α_{i} δ_{T (i) = j} = β_{j}, \forall j \in [m]} .

This is the Monge formulation in the discrete case.

To see what this has to do with probability, suppose that $D_{1}, \dots, D_{n}$ and $H_{1}, \dots, H_{m}$ represent two discrete distributions, with mass $α_{i}$ at $D_{i}$ and mass $β_{j}$ at $H_{j}$ . That is, $α_{1}, \dots, α_{n}$ and $β_{1}, \dots, β_{m}$ constitute two discrete probability measures $α = P_{α}$ and $β = P_{β}$ . $T$ is a map between these two measures that obeys

β_{j} = P_{β} (j) = P_{α} (T^{- 1} (j)) = P_{α} ({i : T (i) = j}) .

That is, the pushforward measure $α_{*} (T)$ must be equivalent to the distribution $β$ . With this perspective in mind, we can also notice that $\sum_{i} α_{i} c (i, T (i))$ is simply the expected value of $c (X, t (X))$ when $X \sim α$ . This motivates the extension of the discrete Monge formulation to the continuous case:

T in f {E_{X \sim α} c (X, T (X)) α_{*} (T) = β} .

This is the (continuous) Monge formulation, and a map $T$ solving the above problem is sometimes called a Monge map (or a transportation plan). There’s an obvious problem with this formulation, however: What if there is no mapping between piles and holes such that each hole will be filled by each pile. That is, what if there is no map $T$ such that $α_{*} (T) = β$ . In fact, this is a very restrictive requirement in many applications. This is what motivates the Kantorovich relaxation and optimal transport costs.

The Stats Map

Explore

Monge formulation

Graph View

Backlinks

Explore