The University of Southampton
University of Southampton Institutional Repository

Acoustic Room Modelling using 360 Stereo Cameras

Acoustic Room Modelling using 360 Stereo Cameras
Acoustic Room Modelling using 360 Stereo Cameras
In this paper we propose a pipeline for estimating acoustic 3D room structure with geometry and attribute prediction using spherical 360 cameras. Instead of setting microphone arrays with loudspeakers to measure acoustic parameters for specific rooms, a simple and practical single-shot capture of the scene using a stereo pair of 360 cameras can be used to simulate those acoustic parameters. We assume that the room and objects can be represented as cuboids aligned to the main axes of the room coordinate (Manhattan world). The scene is captured as a stereo pair using off-the-shelf consumer spherical 360 cameras. A cuboid-based 3D room geometry model is estimated by correspondence matching between captured images and semantic labelling using a convolutional neural network (SegNet). The estimated geometry is used to produce frequency-dependent acoustic predictions of the scene. This is, to our knowledge, the first attempt in the literature to use visual geometry estimation and object classification algorithms to predict acoustic properties. Results are compared to measurements through calculated reverberant spatial audio object parameters used for reverberation reproduction customized to the given loudspeaker set up.
1520-9210
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Fowler, Sam
c302040a-18d7-41b9-ba0c-ec70396c18ad
Jackson, Philip J. B.
01e45068-e098-486c-85ad-4333f4f0a33f
Hilton, Adrian
12782a55-4c4d-4dfb-a690-62505f6665db
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Fowler, Sam
c302040a-18d7-41b9-ba0c-ec70396c18ad
Jackson, Philip J. B.
01e45068-e098-486c-85ad-4333f4f0a33f
Hilton, Adrian
12782a55-4c4d-4dfb-a690-62505f6665db

Kim, Hansung, Remaggi, Luca, Fowler, Sam, Jackson, Philip J. B. and Hilton, Adrian (2020) Acoustic Room Modelling using 360 Stereo Cameras. IEEE Transactions on Multimedia. (doi:10.1109/TMM.2020.3037537).

Record type: Article

Abstract

In this paper we propose a pipeline for estimating acoustic 3D room structure with geometry and attribute prediction using spherical 360 cameras. Instead of setting microphone arrays with loudspeakers to measure acoustic parameters for specific rooms, a simple and practical single-shot capture of the scene using a stereo pair of 360 cameras can be used to simulate those acoustic parameters. We assume that the room and objects can be represented as cuboids aligned to the main axes of the room coordinate (Manhattan world). The scene is captured as a stereo pair using off-the-shelf consumer spherical 360 cameras. A cuboid-based 3D room geometry model is estimated by correspondence matching between captured images and semantic labelling using a convolutional neural network (SegNet). The estimated geometry is used to produce frequency-dependent acoustic predictions of the scene. This is, to our knowledge, the first attempt in the literature to use visual geometry estimation and object classification algorithms to predict acoustic properties. Results are compared to measurements through calculated reverberant spatial audio object parameters used for reverberation reproduction customized to the given loudspeaker set up.

Text
Acoustic Room Modelling using 360 Stereo Cameras - Accepted Manuscript
Download (10MB)

More information

Accepted/In Press date: 29 October 2020
e-pub ahead of print date: 16 November 2020

Identifiers

Local EPrints ID: 447718
URI: http://eprints.soton.ac.uk/id/eprint/447718
ISSN: 1520-9210
PURE UUID: e7df05bd-13f3-4417-8d98-ee5a43dfbf37
ORCID for Hansung Kim: ORCID iD orcid.org/0000-0003-4907-0491

Catalogue record

Date deposited: 18 Mar 2021 17:54
Last modified: 17 Mar 2024 04:01

Export record

Altmetrics

Contributors

Author: Hansung Kim ORCID iD
Author: Luca Remaggi
Author: Sam Fowler
Author: Philip J. B. Jackson
Author: Adrian Hilton

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×