Mixture model
|
In mathematical statistics, the term mixture model has two different meanings.
One definition
A mixture model is a model in which the independent variables are measured as fractions of a total. For example, suppose researchers are trying to find the optimal mixture of ingredients for a fruit punch consisting of grape juice, mango juice, and pineapple juice. A mixture model is suitable here because the results of the taste tests will not depend on the amount of ingredients used to make the batch but rather on the fraction of each ingredient present in the punch. The sum of the mixture components is always 100%, and a mixture model takes this restriction into account.
Another definition
A mixture model can also be a formalism for modeling a probability density function as a sum of parameterized functions. In mathematical terms,
<math>p_{X}(x) = \sum_{k = 1}^{K} a_{k} h(x|\lambda _k)<math>
where <math>p_{X}(x)<math> is the modeled probability distribution function, <math>K<math> is the number of components in the mixture model, and <math>a_{k}<math> is mixture proportion of component <math>k<math>. By definition, <math>0 < a_{k} < 1<math> for all <math>k = 1 ... K <math> and <math>a_{1} + ... + a_{K} = 1<math>.
<math>h(x|\lambda _k)<math> is a probability distribution parameterized by <math>\lambda _k<math>.
Mixture models are often used when we know <math>h(x)<math> and we can sample from <math>p_{X}(x)<math>, but we would like to determine the <math>a_{k}<math> and <math>\lambda _k<math> values. Such situations can arise in studies in which we sample from a population that is composed of several distinct subpopulations.