Linear prediction
|
Linear prediction is a mathematical operation where future values of a digital signal are estimated as a linear function of previous samples.
In digital signal processing linear prediction is often called linear predictive coding (LPC) and can thus be viewed as a subset of filter theory. In system analysis (a subfield of mathematics), linear prediction can be viewed as a part of mathematical modelling or optimization.
The prediction model
The most common representation is
- <math>\hat{x}(n) = -\sum_{i=1}^p a_i x(n-i)<math>
where <math>\hat{x}(n)<math> is the predicted signal value, <math>x(n-i)<math> the previous observed values, and <math>a_i<math> the predictor coefficients. The error generated by this estimate is
- <math>e(n) = x(n) - \hat{x}(n)<math>
where <math>x_n<math> is the true signal value.
These equations are valid for all types of (one-dimensional) linear prediction. The differences are found in the way the parameters <math>a_i<math> are chosen.
For multi-dimensional signals the error metric is often defined as
- <math>e(n) = ||x(n) - \hat{x}(n)||<math>
where <math>||.||<math> is a suitable chosen vector norm.
Estimating the parameters
The most common choice in optimisation of parameters <math>a_i<math> is the root mean square criterion which is also called the autocorrelation criterion. In this method we minimise the expected value of the squared error E[e2(n)], which yields the equation
- <math>\sum_{i=1}^p a_i R(i-j) = -R(j),<math>
for 1 ≤ j ≤ p, where R is the autocorrelation of signal xn, defined as
- <math>R(i) = E\{x(n)x(n-i)\}<math>
In the multi-dimensional case this corresponds to minimising the L2 norm.
The above equations are called the normal equations or Yule-Walker equations. In matrix form the equations can be equivalently written as
- <math>Ra = r,<math>
where the autocorrelation matrix R is a symmetric, circulant matrix with elements ri,j = R(i − j), vector r is the autocorrelation vector rj = R(j), and vector a is the parameter vector.
Another more general approach is to minimise
- <math>e(n) = x(n) - \hat{x}(n) = x(n) + \sum_{i=1}^p a_i x(n-i) = \sum_{i=0}^p a_i x(n-i)<math>
where we usually constrain the parameters <math>a_i<math> with <math>a_0=1<math> to avoid the trivial solution. This constraint yields the same predictor as above but the normal equations are then
- <math>\,\!Ra = [1, 0, ... , 0]^T<math>
where the index i ranges from 0 to p and the size of R is a (p+1) × (p+1) matrix.
Optimisation of the parameters is a wide topic and a large number of other approaches have been proposed.
Still, the autocorrelation method is the most common and it is used, for example, for speech coding in the GSM standard.
Solution of the matrix equation Ra = r is computationally a relatively expensive process. The Gauss algorithm for matrix inversion is probably the oldest solution but this approach does not efficiently use the symmetry of R and r. A faster algorithm is the Levinson recursion proposed by Norman Levinson in 1947, which recursively calculates the solution. Later, Delsarte et al proposed an improvement to this algorithm called the split Levinson recursion which requires about half the number of multiplications and divisions. It uses a special symmetrical property of parameter vectors on subsequent recursion levels.