Abstract

Parametric-stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus a small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psychoacoustical principles. The monaural signal can be encoded using any (conventional) audio coder. Experiments show that the parameterized description of spatial properties enables a highly efficient, high-quality stereo audio representation.

Highlights

  • Efficient coding of wideband audio has gained large interest during the last decades

  • In 1907, Lord Rayleigh formulated the duplex theory [25], which states that sound-source localization is facilitated by interaural intensity differences (IIDs) at high frequencies and by interaural time differences (ITDs) at low frequencies

  • This experiment is useful for two reasons: firstly, to verify statements by others on the maximum quality that can be obtained using parametric stereo, secondly, if parametric stereo is included in an audio coder, the maximum overall bit rate at which parametric stereo still leads to a coding gain compared to conventional stereo techniques is in part dependent on the quality limitations induced by the parametric-stereo algorithm

Read more

Summary

INTRODUCTION

Efficient coding of wideband audio has gained large interest during the last decades. Studies indicate only limited success in exploiting coherence using such techniques [9, 10] These results indicate that exploiting cross-channel redundancies, even if the signal model is able to capture relative time delays, does not lead to a large coding gain. It is well known that for high frequencies (typically above 2 kHz), the human auditory system is not sensitive to fine-structure phase differences between the left and right signals in a stereo recording [11, 12] This phenomenon is exploited by a technique referred to as “intensity stereo” [13, 14].

PSYCHOACOUSTIC BACKGROUND
Headphones versus loudspeaker rendering
Mono coding effects
CODER IMPLEMENTATION
FFT-BASED ENCODER
Segmentation
Frequency separation
Parameter extraction
Downmix
Parameter quantization and coding
FFT-BASED DECODER
Ns cos
QMF-BASED DECODER
PERCEPTUAL EVALUATION
Listening test I
Findings
CONCLUSIONS
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.