MPEG Standards for Compressed Representation of Immersive Audio

Schuyler R Quackenbush,Jurgen Herre

doi:10.1109/jproc.2021.3075390

Schuyler R Quackenbush, Jurgen Herre

Open Access

PDF Available

https://doi.org/10.1109/jproc.2021.3075390

Copy DOI

Export

Save

Cite

Journal: Proceedings of the IEEE	Publication Date: Sep 1, 2021
Citations: 14	License type: CC BY 4.0

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

The term “immersive audio” is frequently used to describe an audio experience that provides the listener the sensation of being fully immersed or “present” in a sound scene. This can be achieved via different presentation modes, such as surround sound (several loudspeakers horizontally arranged around the listener), 3D audio (with loudspeakers at, above, and below listener ear level), and binaural audio to headphones. This article provides an overview of two recent standards that support the bitrate-efficient carriage of high-quality immersive sound. The first is MPEG-H 3D audio, which is a versatile standard that supports multiple immersive sound signal formats (channels, objects, and higher order ambisonics) and is now being adopted in broadcast and streaming applications. The second is MPEG-I immersive audio, an extension of 3D audio, currently under development, which is targeted for virtual and augmented reality applications. This will support rendering of fully user-interactive immersive sound for three degrees of user movement [three degrees of freedom (3DoF)], i.e., yaw, pitch, and roll head movement, and for six degrees of user movement [six degrees of freedom (6DoF)], i.e., 3DoF plus translational x, y, and z user position movements.

Highlights

The term “immersive audio” is often used to characterize the latest generation of sound systems that aim at providing an audio experience that conveys to the listener the sensation of being fully immersed into or “present” in a surrounding sound scene
The main part of the incoming Motion Picture Experts Group (MPEG)-H 3D audio bitstream is decoded by the core decoder that reproduces the encoded waveforms that represent either channel signals, object signals, or higher order ambisonics (HOA) coefficient signals
In MPEG-I, the user can move around in the world created by the media presentation, with head movement or both head movement and body movement in virtual space, where we assume that audio presentation is done via headphones

Summary

INTRODUCTION

The term “immersive audio” is often used to characterize the latest generation of sound systems that aim at providing an audio experience that conveys to the listener the sensation of being fully immersed into or “present” in a surrounding sound scene. While early sound reproduction systems provided stereophonic sound reproduction over two loudspeakers with an illusion of left-right (and depth) perception to the listener for a limited frontal sound field [1], [2], the second generation added a 360◦ “surround” experience that extended the presented sound stage to include both to the extreme left and right, as well as sound from behind the listener by adding more loudspeakers from all horizontal directions (e.g., 5.1 and 7.1 [3]–[5]) This already provides a significant degree of user immersion into the sound field. MPEG-H is described in [11]; it is a foundational technology for MPEG-I audio and, requires some description here in order to make this article understood by the reader

M P E G-HAUDIO

Overview and Concepts

Waveform Coding

Format Conversion

Object Rendering

HOA Decoding and Rendering

Performance

M P E G-IIMMERSIVEAUDIO

MPEG-I 3DoF Audio

Requirements for MPEG-I 6DoF Audio

Developing MPEG-I 6DoF Immersive Audio

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

MPEG Standards for Compressed Representation of Immersive Audio

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Proceedings of the IEEE

Lead the way for us

Similar Papers

EVALUATION OF VIRTUAL REALITY AND AUGMENTED REALITY FOR TEACHING THE LESSON OF GEOMETRIC SOLIDS TO PRIMARY SCHOOL CHILDREN
Eleni Demitriadou ... Andreas Lanitis
-
Eleni Demitriadou, et. al.Eleni Demitriadou ... Andreas Lanitis
01 Jul 2019
01 Jul 2019

A review of the application of virtual and augmented reality in physical and occupational therapy
Agrawal Luckykumar Dwarkadas ... Rama Krishna Challa
Software: Practice and Experience | VOL. -
Agrawal Luckykumar Dwarkadas, et. al.Agrawal Luckykumar Dwarkadas ... Rama Krishna Challa
02 Mar 2024
Software: Practice and Experience | VOL. -

Fast HRFT measurement system with unconstrained head movements for 3D audio in virtual and augmented reality applications
Nguyen Duy Hai ... Woon-Seng Gan
-
Nguyen Duy Hai, et. al.Nguyen Duy Hai ... Woon-Seng Gan
01 Mar 2017
01 Mar 2017

EFFECTIVENESS ON TRAINING METHOD USING VIRTUAL REALITY AND AUGMENTED REALITY APPLICATIONS IN AUTOMOBILE ENGINE ASSEMBLY
Lai Lai Win ... Norhisham Seyajah
ASEAN Engineering Journal | VOL. 12
Lai Lai Win, et. al.Lai Lai Win ... Norhisham Seyajah
29 Nov 2022
ASEAN Engineering Journal | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

MPEG Standards for Compressed Representation of Immersive Audio

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Proceedings of the IEEE