Method for parametric density estimation. We are trying estimate an unknown -dimensional parameter (statistical inference). The method of moments is an old school method (though it’s recently seen a revival in interest, eg for estimating Gaussian mixtures), which equates the first theoretical moments with the first empirical moments. That is, we solve

(where is unknown). Note this is a purely frequentist approach. Because we’re only trying to solve the moment equations, the resulting parameter estimates can be outside of the parameter space . This doesn’t happen with the MLE or with Bayesian approaches, which both naturally respect .

The MoM is consistent under fairly weak assumptions, and it obeys a CLT. It’s not guaranteed to be unbiased. For exponential families, the MoM estimator coincides with the MLE.