Higher order stereophony
Higher order stereophony
This thesis presents the derivation, experimental validation and perceptual evaluation of a new technique for spatial audio reproduction named Higher Order Stereophony. The approach uses the Taylor expansion of a plane wave soundfield to represent the field as a summation of its derivatives. The soundfield is then reproduced correctly along a single axis only, which is assumed to be the listener’s interaural axis, in an attempt to reproduce the correct binaural signals for 3D audio reproduction. A dynamic extension to the technique using head-tracking dynamically adapts the loudspeaker gains to ensure the reproduction axis always aligns with the listener’s interaural axis, regardless of the listener’s orientation.
Higher Order Stereophony is shown to be a generalisation to higher orders of classic stereo techniques such as the sine law, and is a frequency-independent solution resulting in loudspeaker panning gains. This generalisation is similar in manner to Higher Order Ambisonics, and parallels between the two are investigated including the derivation of decoders to transform from both the 3D or 2D Higher Order Ambisonics representation to Higher Order Stereophony, which first require a specific rotation of the soundfield followed by a matrix of gains. The new approach presents an alternative to Higher Order Ambisonics, with advantages in requiring only (N + 1) channels/loudspeakers for N-th order reproduction as well as requiring loudspeakers in front of the listener only.
Higher Order Stereophony is also applied to binaural rendering and shown to fully reproduce binaural signals with a rigid sphere Head-Related Transfer Function, due to the axisymmetric geometry of the scattering in the head model. When representing any Head-Related Transfer Function using spherical harmonics, a specific Higher Order Stereophony rotation is defined to align the interaural axis with the z axis, which is shown to reorder the energy of the spherical harmonic coefficients to those with m close to 0. Higher Order Stereophony is then demonstrated to reproduce all spherical harmonic coefficients of the Head-Related Transfer Function with m = 0 only. Subjective experiments comparing Higher Order Ambisonics and Higher Order Stereophony show that Higher Order Stereophony can perform similarly to Higher Order Ambisonics despite its advantages computationally and regarding loudspeaker positioning.
University of Southampton
Hollebon, Jacob
75e4dd71-cfb5-4d28-82a5-7ee1bee73207
15 March 2023
Hollebon, Jacob
75e4dd71-cfb5-4d28-82a5-7ee1bee73207
Fazi, Filippo
e5aefc08-ab45-47c1-ad69-c3f12d07d807
Hollebon, Jacob
(2023)
Higher order stereophony.
University of Southampton, Doctoral Thesis, 181pp.
Record type:
Thesis
(Doctoral)
Abstract
This thesis presents the derivation, experimental validation and perceptual evaluation of a new technique for spatial audio reproduction named Higher Order Stereophony. The approach uses the Taylor expansion of a plane wave soundfield to represent the field as a summation of its derivatives. The soundfield is then reproduced correctly along a single axis only, which is assumed to be the listener’s interaural axis, in an attempt to reproduce the correct binaural signals for 3D audio reproduction. A dynamic extension to the technique using head-tracking dynamically adapts the loudspeaker gains to ensure the reproduction axis always aligns with the listener’s interaural axis, regardless of the listener’s orientation.
Higher Order Stereophony is shown to be a generalisation to higher orders of classic stereo techniques such as the sine law, and is a frequency-independent solution resulting in loudspeaker panning gains. This generalisation is similar in manner to Higher Order Ambisonics, and parallels between the two are investigated including the derivation of decoders to transform from both the 3D or 2D Higher Order Ambisonics representation to Higher Order Stereophony, which first require a specific rotation of the soundfield followed by a matrix of gains. The new approach presents an alternative to Higher Order Ambisonics, with advantages in requiring only (N + 1) channels/loudspeakers for N-th order reproduction as well as requiring loudspeakers in front of the listener only.
Higher Order Stereophony is also applied to binaural rendering and shown to fully reproduce binaural signals with a rigid sphere Head-Related Transfer Function, due to the axisymmetric geometry of the scattering in the head model. When representing any Head-Related Transfer Function using spherical harmonics, a specific Higher Order Stereophony rotation is defined to align the interaural axis with the z axis, which is shown to reorder the energy of the spherical harmonic coefficients to those with m close to 0. Higher Order Stereophony is then demonstrated to reproduce all spherical harmonic coefficients of the Head-Related Transfer Function with m = 0 only. Subjective experiments comparing Higher Order Ambisonics and Higher Order Stereophony show that Higher Order Stereophony can perform similarly to Higher Order Ambisonics despite its advantages computationally and regarding loudspeaker positioning.
Text
HigherOrderStereophony
- Version of Record
Text
Final-thesis-submission-Examination-Mr-Jacob-Hollebon (2)
Restricted to Repository staff only
More information
Published date: 15 March 2023
Identifiers
Local EPrints ID: 476263
URI: http://eprints.soton.ac.uk/id/eprint/476263
PURE UUID: fac1a1b6-5722-497c-9810-39195d3c7de8
Catalogue record
Date deposited: 18 Apr 2023 16:33
Last modified: 17 Mar 2024 04:09
Export record
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics