Here’s an intuitive explanation of Markov’s inequality. Suppose we fix the probability and ask the question: How small can we make subject to the constraint that a.s.?

That is, we want to solve

This is easy. If we could, we’d put all of ‘s mass at 0, which would minimize subject to . But we have to move at least some of the mass to or beyond. So we’ll put mass at , and mass at 0. It should be clear that any other alteration would simply increase . With these choices, we have . Since this minimized , we have for all other which is precisely Markov’s inequality.