Comparison of Linear Prediction Models for Audio Signals

Toon Van Waterschoot,Marc Moonen

doi:10.1155/2008/706935

Abstract

While linear prediction (LP) has become immensely popular in speech modeling, it does not seem to provide a good approach for modeling audio signals. This is somewhat surprising, since a tonal signal consisting of a number of sinusoids can be perfectly predicted based on an (all-pole) LP model with a model order that is twice the number of sinusoids. We provide an explanation why this result cannot simply be extrapolated to LP of audio signals. If noise is taken into account in the tonal signal model, a low-order all-pole model appears to be only appropriate when the tonal components are uniformly distributed in the Nyquist interval. Based on this observation, different alternatives to the conventional LP model can be suggested. Either the model should be changed to a pole-zero, a high-order all-pole, or a pitch prediction model, or the conventional LP model should be preceded by an appropriate frequency transform, such as a frequency warping or downsampling. By comparing these alternative LP models to the conventional LP model in terms of frequency estimation accuracy, residual spectral flatness, and perceptual frequency resolution, we obtain several new and promising approaches to LP-based audio modeling.

Highlights

Linear prediction (LP) is a widely used and well-understood technique for the analysis, modeling, and coding of speech signals [1]
In the last two alternative LP models, namely, the warped LP (WLP) model and the selective LP (SLP) model, the performance of the conventional low-order all-pole model is increased by first transforming the input signal such that its tonal components are spread in the entire Nyquist interval
We evaluate the conventional and alternative LP models described in Sections 3 and 4 in terms of frequency estimation accuracy, residual spectral flatness, and perceptual frequency resolution for a synthetic harmonic audio signal with varying fundamental frequency and signal-to-noise ratio (SNR)

Summary

INTRODUCTION

Linear prediction (LP) is a widely used and well-understood technique for the analysis, modeling, and coding of speech signals [1]. One could expect that performing LP using a model order that is twice the number of tonal components leads to a signal estimate in which each of the spectral peaks is modeled with a complex conjugate pole pair close to (but inside) the unit circle This does not EURASIP Journal on Audio, Speech, and Music Processing seem to be the case, and very often a poor LP signal estimate is obtained. All considered approaches result in stable LP models, and some outperform the WLP model both in terms of conventional measures, such as frequency estimation error and residual spectral flatness [43, Chapter 6], and in terms of perceptually motivated measures, such as interpeak dip depth (IDD) [12] Many of these alternative models perform even better when cascaded with a conventional LP model.

Tonal audio signal model

Linear prediction criterion

CONVENTIONAL LINEAR PREDICTION MODEL

ALTERNATIVE LINEAR PREDICTION MODELS

Constrained pole-zero LP model

High-order LP model

Pitch prediction model

Warped LP Model

Selective LP Model

SIMULATION RESULTS

Synthetic audio signal

Monophonic audio signal

Polyphonic audio signal

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Eurasip Journal on Audio, Speech, and Music Processing	Publication Date: Jan 1, 2008
Citations: 26	License type: cc-by

R Discovery Prime

R Discovery Prime

Comparison of Linear Prediction Models for Audio Signals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Eurasip Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Differences between LP orders for tonal and noise parts of audio signal
Ondrej Raso ... Miroslav Balik
-
Ondrej Raso, et. al.Ondrej Raso ... Miroslav Balik
01 Jul 2013
01 Jul 2013

Formant-tracking linear prediction models for speech processing in noisy environments
Qin Yan ... Saeed Vaseghi
-
Qin Yan, et. al.Qin Yan ... Saeed Vaseghi
04 Sep 2005
04 Sep 2005

Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing
Qin Yan ... Ioannis Andrianakis
Computer Speech & Language | VOL. 21
Qin Yan, et. al.Qin Yan ... Ioannis Andrianakis
26 Dec 2006
Computer Speech & Language | VOL. 21

Super-resolution processing technique for vector sensors
Dayalan Kasilingam ... Paulo Pacheco
-
Dayalan Kasilingam, et. al.Dayalan Kasilingam ... Paulo Pacheco
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Linear Prediction Models for Audio Signals

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Eurasip Journal on Audio, Speech, and Music Processing