Suppose we have a latent variable model

where are latent, and is the data. A core problem in Bayesian statistics is to compute the conditional distribution , i.e., to solve the problem: what are the hidden parameters given the data?

Writing,

we start to see the trouble. The integral on the rhs is usually intractable, since it involves integrating across all latent variables. So we typically approximate in some way. This is the problem of approximate inference. Noet that is usually called the “evidence”.

Common solutions to the problem include MCMC and variational inference.