Room Acoustic Properties Estimation from a Single 360 Photo
Room Acoustic Properties Estimation from a Single 360 Photo
Estimating room impulse responses (RIRs) in real spaces is a time-consuming and expensive process requiring multiple pieces of equipment, recordings, and processing. A simple computer-vision-based method from a single 360◦photo is proposed to estimate the acoustic material properties of the space by reconstructing an approximated 3D geometry. A 3D semantic geometry model is reconstructed from a 360◦image by monocular depth estimation and semantic scene completion. The material properties of semantic objects in the scene are estimated using the transformer-based dense material segmentation method. This model is used to simulate a 3D acoustic room model on the Unity platform with Steam spatial audio plug-in. Acoustic properties of the space are estimated from this virtual reproduction and evaluated against the actual ones in the real environment. Index Terms—3D reconstruction and completion, room acoustic modeling, depth estimation, material estimation.
857
Alawadh, Mona
60613079-426e-425a-81d3-09a6fbb7a92c
Wu, Yihong
2876bede-25f1-47a5-9e08-b98be99b2d31
Heng, Yuwen
a3edf9da-2d3b-450c-8d6d-85f76c861849
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
1 September 2022
Alawadh, Mona
60613079-426e-425a-81d3-09a6fbb7a92c
Wu, Yihong
2876bede-25f1-47a5-9e08-b98be99b2d31
Heng, Yuwen
a3edf9da-2d3b-450c-8d6d-85f76c861849
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Alawadh, Mona, Wu, Yihong, Heng, Yuwen, Remaggi, Luca, Niranjan, Mahesan and Kim, Hansung
(2022)
Room Acoustic Properties Estimation from a Single 360 Photo.
European conference on signal processing 2022, , Belgrade, Serbia.
29 Aug - 02 Sep 2022.
.
Record type:
Conference or Workshop Item
(Paper)
Abstract
Estimating room impulse responses (RIRs) in real spaces is a time-consuming and expensive process requiring multiple pieces of equipment, recordings, and processing. A simple computer-vision-based method from a single 360◦photo is proposed to estimate the acoustic material properties of the space by reconstructing an approximated 3D geometry. A 3D semantic geometry model is reconstructed from a 360◦image by monocular depth estimation and semantic scene completion. The material properties of semantic objects in the scene are estimated using the transformer-based dense material segmentation method. This model is used to simulate a 3D acoustic room model on the Unity platform with Steam spatial audio plug-in. Acoustic properties of the space are estimated from this virtual reproduction and evaluated against the actual ones in the real environment. Index Terms—3D reconstruction and completion, room acoustic modeling, depth estimation, material estimation.
Text
EUSIPCO-CameraReady
- Accepted Manuscript
More information
Published date: 1 September 2022
Venue - Dates:
European conference on signal processing 2022, , Belgrade, Serbia, 2022-08-29 - 2022-09-02
Identifiers
Local EPrints ID: 470328
URI: http://eprints.soton.ac.uk/id/eprint/470328
PURE UUID: 3f5c5d80-2c35-432c-b0ee-e42f41c4d29e
Catalogue record
Date deposited: 06 Oct 2022 16:51
Last modified: 17 Mar 2024 04:04
Export record
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics