Immersive virtual reality audio rendering adapted to the listener and the room

The visual and auditory modalities are the most important stimuli for humans. In order to maximise the sense of immersion in VR environments, a plausible spatial audio reproduction synchronised with visual information is essential. However, measuring acoustic properties of an environment using audio equipment is a complicated process. In this chapter, we introduce a simple and efficient system to estimate room acoustic for plausible spatial audio rendering using 360° cameras for real scene reproduction in VR. A simplified 3D semantic model of the scene is estimated from captured images using computer vision algorithms and convolutional neural network (CNN). Spatially synchronised audio is reproduced based on the estimated geometric and acoustic properties in the scene. The reconstructed scenes are rendered with synthesised spatial audio.

3D modeling, Acoustic properties, Audio acoustics, Computer vision, Convolutional neural networks, Room acoustics, Sound reproduction, Spatial Audio, virtual reality

10.1007/978-3-030-41816-8_13

293-318

Springer

Kim, Hansung

2c7c135c-f00b-4409-acb2-85b3a9e8225f

Remaggi, Luca

c74406cb-15d2-4575-b086-97b55421649e

Jackson, P.J.B.

82ff8754-919e-4c17-b57a-bba28e41c6ee

Hilton, Adrian

12782a55-4c4d-4dfb-a690-62505f6665db

Magnor, M

Sorkin-Hornung, A

2020

Kim, Hansung

2c7c135c-f00b-4409-acb2-85b3a9e8225f

Remaggi, Luca

c74406cb-15d2-4575-b086-97b55421649e

Jackson, P.J.B.

82ff8754-919e-4c17-b57a-bba28e41c6ee

Hilton, Adrian

12782a55-4c4d-4dfb-a690-62505f6665db

Magnor, M

Sorkin-Hornung, A

Kim, Hansung, Remaggi, Luca, Jackson, P.J.B. and Hilton, Adrian (2020) Immersive virtual reality audio rendering adapted to the listener and the room. In, Magnor, M and Sorkin-Hornung, A (eds.) Real VR – Immersive Digital Reality. (Lecture Notes in Computer Science, 11900) Cham. Springer, pp. 293-318. (doi:10.1007/978-3-030-41816-8_13).

Record type: Book Section

Abstract

This record has no associated files available for download.

More information

e-pub ahead of print date: 3 March 2020

Published date: 2020

Additional Information: Publisher Copyright: © Springer Nature Switzerland AG 2020.

Keywords: 3D modeling, Acoustic properties, Audio acoustics, Computer vision, Convolutional neural networks, Room acoustics, Sound reproduction, Spatial Audio, virtual reality

Learn more about Vision, Learning and Control research