Spatial audio reconstruction for VR applications using a combined method based on SIRR and RSAO approaches
Spatial audio reconstruction for VR applications using a combined method based on SIRR and RSAO approaches
In order to recreate a sound field as realistic and immersive as the original setting, it is important to preserve the properties of the room acoustics. The room acoustics are usually captured by room impulse responses (RIRs) which can be employed to generate sound that could lead to the same audio perception. In this paper, we compare and combine two main parametric approaches, Spatial Impulse Response Rendering (SIRR) and Reverberant Spatial Audio Object (RSAO) to encode and render the RIRs. We first discuss that SIRR synthesises the early reflections more precisely whereas RSAO is a better approach to render the reverberation. For our proposed combined method, each RIR is divided into three parts: The direct sound, early reflections, and late reverberation, as in RSAO. To estimate the boundary time between the early reflections and late reverberation, we employ the diffuseness factor as in SIRR and set the mixing time as the time when the diffuseness factor reaches a threshold. Then the early part is analysed and synthesised by SIRR while the reverberant tail is encoded and rendered using RSAO. We show that in general, the combined method benefits the positive aspects of the two baseline methods.
Alinaghi, Atiyeh
69c051f1-9b47-4c47-b9e3-52fa3079f9a3
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Alinaghi, Atiyeh
69c051f1-9b47-4c47-b9e3-52fa3079f9a3
Remaggi, Luca
c74406cb-15d2-4575-b086-97b55421649e
Kim, Hansung
2c7c135c-f00b-4409-acb2-85b3a9e8225f
Alinaghi, Atiyeh, Remaggi, Luca and Kim, Hansung
(2023)
Spatial audio reconstruction for VR applications using a combined method based on SIRR and RSAO approaches.
In 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP).
IEEE.
6 pp
.
(doi:10.1109/MMSP59012.2023.10337683).
Record type:
Conference or Workshop Item
(Paper)
Abstract
In order to recreate a sound field as realistic and immersive as the original setting, it is important to preserve the properties of the room acoustics. The room acoustics are usually captured by room impulse responses (RIRs) which can be employed to generate sound that could lead to the same audio perception. In this paper, we compare and combine two main parametric approaches, Spatial Impulse Response Rendering (SIRR) and Reverberant Spatial Audio Object (RSAO) to encode and render the RIRs. We first discuss that SIRR synthesises the early reflections more precisely whereas RSAO is a better approach to render the reverberation. For our proposed combined method, each RIR is divided into three parts: The direct sound, early reflections, and late reverberation, as in RSAO. To estimate the boundary time between the early reflections and late reverberation, we employ the diffuseness factor as in SIRR and set the mixing time as the time when the diffuseness factor reaches a threshold. Then the early part is analysed and synthesised by SIRR while the reverberant tail is encoded and rendered using RSAO. We show that in general, the combined method benefits the positive aspects of the two baseline methods.
This record has no associated files available for download.
More information
e-pub ahead of print date: 8 December 2023
Venue - Dates:
2023 IEEE 25th International Workshop on Multimedia Signal Processing, Futuroscope Technopole, Chasseneuil-du-Poitou, France, 2023-09-27 - 2023-09-29
Identifiers
Local EPrints ID: 490281
URI: http://eprints.soton.ac.uk/id/eprint/490281
PURE UUID: 618570de-fe96-4186-a382-f00a9d22ee2a
Catalogue record
Date deposited: 23 May 2024 16:35
Last modified: 24 May 2024 01:58
Export record
Altmetrics
Contributors
Author:
Atiyeh Alinaghi
Author:
Luca Remaggi
Author:
Hansung Kim
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics