Acoustic Room Modelling using 360 Stereo Cameras
Acoustic Room Modelling using 360 Stereo Cameras
In this paper we propose a pipeline for estimating acoustic 3D room structure with geometry and attribute prediction using spherical 360 cameras. Instead of setting microphone arrays with loudspeakers to measure acoustic parameters for specific rooms, a simple and practical single-shot capture of the scene using a stereo pair of 360 cameras can be used to simulate those acoustic parameters. We assume that the room and objects can be represented as cuboids aligned to the main axes of the room coordinate (Manhattan world). The scene is captured as a stereo pair using off-the-shelf consumer spherical 360 cameras. A cuboid-based 3D room geometry model is estimated by correspondence matching between captured images and semantic labelling using a convolutional neural network (SegNet). The estimated geometry is used to produce frequency-dependent acoustic predictions of the scene. This is, to our knowledge, the first attempt in the literature to use visual geometry estimation and object classification algorithms to predict acoustic properties. Results are compared to measurements through calculated reverberant spatial audio object parameters used for reverberation reproduction customized to the given loudspeaker set up.
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Fowler, Sam
c302040a-18d7-41b9-ba0c-ec70396c18ad
Jackson, Philip J. B.
01e45068-e098-486c-85ad-4333f4f0a33f
Hilton, Adrian
12782a55-4c4d-4dfb-a690-62505f6665db
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Fowler, Sam
c302040a-18d7-41b9-ba0c-ec70396c18ad
Jackson, Philip J. B.
01e45068-e098-486c-85ad-4333f4f0a33f
Hilton, Adrian
12782a55-4c4d-4dfb-a690-62505f6665db
Kim, Hansung, Remaggi, Luca, Fowler, Sam, Jackson, Philip J. B. and Hilton, Adrian
(2020)
Acoustic Room Modelling using 360 Stereo Cameras.
IEEE Transactions on Multimedia.
(doi:10.1109/TMM.2020.3037537).
Abstract
In this paper we propose a pipeline for estimating acoustic 3D room structure with geometry and attribute prediction using spherical 360 cameras. Instead of setting microphone arrays with loudspeakers to measure acoustic parameters for specific rooms, a simple and practical single-shot capture of the scene using a stereo pair of 360 cameras can be used to simulate those acoustic parameters. We assume that the room and objects can be represented as cuboids aligned to the main axes of the room coordinate (Manhattan world). The scene is captured as a stereo pair using off-the-shelf consumer spherical 360 cameras. A cuboid-based 3D room geometry model is estimated by correspondence matching between captured images and semantic labelling using a convolutional neural network (SegNet). The estimated geometry is used to produce frequency-dependent acoustic predictions of the scene. This is, to our knowledge, the first attempt in the literature to use visual geometry estimation and object classification algorithms to predict acoustic properties. Results are compared to measurements through calculated reverberant spatial audio object parameters used for reverberation reproduction customized to the given loudspeaker set up.
Text
Acoustic Room Modelling using 360 Stereo Cameras
- Accepted Manuscript
More information
Accepted/In Press date: 29 October 2020
e-pub ahead of print date: 16 November 2020
Identifiers
Local EPrints ID: 447718
URI: http://eprints.soton.ac.uk/id/eprint/447718
ISSN: 1520-9210
PURE UUID: e7df05bd-13f3-4417-8d98-ee5a43dfbf37
Catalogue record
Date deposited: 18 Mar 2021 17:54
Last modified: 17 Mar 2024 04:01
Export record
Altmetrics
Contributors
Author:
Hansung Kim
Author:
Luca Remaggi
Author:
Sam Fowler
Author:
Philip J. B. Jackson
Author:
Adrian Hilton
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics