Acoustic Room Modelling Using 360 Stereo Cameras

Hansung Kim,Philip Jb Jackson,Sam Fowler,Adrian Hilton,Luca Remaggi

doi:10.1109/tmm.2020.3037537

Hansung Kim, Philip Jb Jackson + Show 3 more

Open Access

https://doi.org/10.1109/tmm.2020.3037537

Copy DOI

Abstract

In this paper we propose a pipeline for estimating acoustic 3D room structure with geometry and attribute prediction using spherical 360 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$^{\circ }$</tex-math></inline-formula> cameras. Instead of setting microphone arrays with loudspeakers to measure acoustic parameters for specific rooms, a simple and practical single-shot capture of the scene using a stereo pair of 360 cameras can be used to simulate those acoustic parameters. We assume that the room and objects can be represented as cuboids aligned to the main axes of the room coordinate (Manhattan world). The scene is captured as a stereo pair using off-the-shelf consumer spherical 360 cameras. A cuboid-based 3D room geometry model is estimated by correspondence matching between captured images and semantic labelling using a convolutional neural network (SegNet). The estimated geometry is used to produce frequency-dependent acoustic predictions of the scene. This is, to our knowledge, the first attempt in the literature to use visual geometry estimation and object classification algorithms to predict acoustic properties. Results are compared to measurements through calculated reverberant spatial audio object parameters used for reverberation reproduction customized to the given loudspeaker set up.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Acoustic Room Modelling Using 360 Stereo Cameras

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Nov 12, 2020
Citations: 3

Similar Papers

Semantic Labeling of Aerial and Satellite Imagery
Sakrapee Paisitkriangkrai ... Pranam Janney
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 9
Sakrapee Paisitkriangkrai, et. al.Sakrapee Paisitkriangkrai ... Pranam Janney
01 Jul 2016
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 9

Strong-Structural Convolution Neural Network for Semantic Segmentation
Yi Ouyang
Pattern Recognition and Image Analysis | VOL. 29
Yi OuyangYi Ouyang
01 Oct 2019
Pattern Recognition and Image Analysis | VOL. 29

High-Resolution Aerial Image Labeling With Convolutional Neural Networks
Emmanuel Maggiori ... Yuliya Tarabalka
IEEE Transactions on Geoscience and Remote Sensing | VOL. 55
Emmanuel Maggiori, et. al.Emmanuel Maggiori ... Yuliya Tarabalka
01 Dec 2017
IEEE Transactions on Geoscience and Remote Sensing | VOL. 55

Author response: Invariant representation of physical stability in the human brain
RT Pramod ... Joshua B Tenenbaum
-
RT Pramod, et. al.RT Pramod ... Joshua B Tenenbaum
09 Feb 2022
09 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic Room Modelling Using 360 Stereo Cameras

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia