The general approach is as follows. Let be an -cover of (see covering and packing). Then

so

From here, is a finite set, so we can apply well-known maximal inequalities over finite sets. For instance, if is sub-Gaussian, then light-tailed maximal inequalities give

where we use an upper bound on the -covering number of . This gives

We can then optimize over to get a final bound. The sub-Gaussian case usually takes (e.g., Theorem 1.19 in high-dimensional statistics by Rigollet and Hutter). We can also get a high-probability bound by noticing that

which can be bounded with a union bound and a tail bound and then maybe a Chernoff bound (Chernoff method) based on the properties of .