Abstract

Multichannel audio offers significant advantages for music reproduction, including the ability to provide better localization and envelopment, as well as reduced imaging distortion. On the other hand, multichannel audio is a demanding media type in terms of transmission requirements. Often, bandwidth limitations prohibit transmission of multiple audio channels. In such cases, an alternative is to transmit only one or two reference channels and recreate the rest of the channels at the receiving end. Here, we propose a system capable of synthesizing the required signals from a smaller set of signals recorded in a particular venue. These synthesized virtual microphone signals can be used to produce multichannel recordings that accurately capture the acoustics of that venue. Applications of the proposed system include transmission of multichannel audio over the current Internet infrastructure and, as an extension of the methods proposed here, remastering existing monophonic and stereophonic recordings for multichannel rendering.

Highlights

  • Multichannel audio can enhance the sense of immersion for a group of listeners by reproducing the sounds that would originate from several directions around the listeners, simulating the way we perceive sound in a real acoustical space

  • The proposed method is based on information of the acoustics of a specific concert hall and the microphone locations with respect to the orchestra; this information can be extracted from the specific multichannel recording

  • Instead of warping the frequency spectrum using the Bark scale as is usual in speech analysis, the frequency spectrum was divided in subbands and each one was treated separately under the analysis presented in the previous section

Read more

Summary

INTRODUCTION

Multichannel audio can enhance the sense of immersion for a group of listeners by reproducing the sounds that would originate from several directions around the listeners, simulating the way we perceive sound in a real acoustical space. The result would be a significant reduction in transmission requirements, while enabling mixing at the receiving end Such a system would be suitable for completely resynthesizing any number of channels in the initial recording (i.e., no information about the target recordings needs to be transmitted other than the conversion parameters). The filters designed are capable of recreating the acoustic properties of the venue where the specific recordings took place If these filters are applied to an arbitrary (nonreverberant) recording, the resulting signal will contain the venue characteristics at the particular microphone location. In such manner, it is possible to completely synthesize reverberant stem recordings and, a multichannel recording.

SPOT MICROPHONE RESYNTHESIS
Spectral conversion
SC based on VQ
SC based on GMM
Subband processing
Residual processing for percussive sounds
Implementation details
REVERBERANT MICROPHONE SIGNAL SYNTHESIS
IIR filter design
Mutual information as a spectral distortion measure
CONCLUSIONS AND FUTURE RESEARCH
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.