Abstract

Psychoacoustic models of human auditory perception have found an important application in the realm of perceptual audio coding, where exploiting the limitations of perception and removal of irrelevance is key to achieving a significant reduction in bitrate while preserving subjective audio quality. To this end, psychoacoustic models do not need to be perfect to satisfy their purpose, and in fact the commonly employed models only represent a small subset of the known properties and abilities of the human auditory system. This paper provides a tutorial introduction of the most commonly used psychoacoustic models for low bitrate perceptual audio coding.

Highlights

  • Psychoacoustic models of human auditory perception have found an important application in the realm of perceptual audio coding, where exploiting limitations of perception and reduction of irrelevance are key to achieving a significant reduction in bitrate while preserving subjective audio quality, even at very high data compression factors

  • Experts Group (MPEG) Advanced Audio Coding (AAC) “Moving PictureExperts Group (MPEG)-2/4 AAC” [1,2], deliver high quality stereo at bitrates of 96 kbit/s, which corresponds to a data reduction factor of about as compared to a Compact Disc (CD) audio originally sampled at 44.1 kHz and bits

  • Compared to previous models used for audio coding, the model includes a number of advanced aspects, such as using internal representations rather than difference or noise signals, modulation perception, Comodulation Masking Release (CMR), and asymmetry of perception, as it is motivated by auditory scene analysis [83]

Read more

Summary

Introduction

In Proceedings of the 17th International AES Conference on High Quality Audio Coding, Audio Engineering Society, New York, NY, USA, September 1999. C.R.; Carlsson, P.; Disch, S.; Edler, B.; Hilpert, J.; Kjörling, K.; Neusinger, M.; Purnhagen, H.; Rettelbach, N.; Robilliard, J.; et al Efficient Transform Coding of Two-Channel Audio Signals by Means of Complex-Valued Stereo Prediction. J.; Kjörling, K.; Breebaart, J.; Faller, C.; Disch, S.; Purnhagen, H.; Koppens, J.; Hilpert, J.; Rödén, J.; Oomen, W.; et al MPEG Surround—The ISO/MPEG Standard for Efficient and Compatible Multichannel. In Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No 01TH8575), New Platz, NY, USA, 24 October 2001. J.; Purnhagen, H.; Koppens, J.; Hellmuth, O.; Engdegård, J.; Hilper, J.; Villemoes, L.; Terentiv, L.; Falch, C.; Hölzer, A.; et al MPEG Spatial Audio Object Coding—The ISO/MPEG Standard for Efficient.

Monaural
Properties of Monaural Human Hearing
Critical
Illustration
Classic
High Efficiency Models for Perceptual Audio Coding
Binaural Hearing
Models for Coding of Stereo
Generalization to Coding of Multi-Channel Audio
Models for Parametric Audio Coding
Parametric Spatial Audio Coding
Some Recent Developments
Summary and Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call