A family of distributions form a -dimensional exponential family if their densities have the form

for some functions and of the parameter space and function of the data. Typically we parameterize the family such that we may just write .

The exponential family structure is preserved for iid samples: For samples, becomes , becomes and becomes .

Many distributions are exponential families; normals, binomials, Poisson, etc. See wikipedia for more.

The tuple is a sufficient statistic. is the factor which ensures the density integrates to 1. It’s called the log-partition function and can be written as

is also known as the cumulant function. It is a convex function of . Its derivatives generate moments as follows: