The University of Southampton
University of Southampton Institutional Repository

Sparse, l1-optimal multi-loudspeaker panning and its relation to vector base amplitude panning

Sparse, l1-optimal multi-loudspeaker panning and its relation to vector base amplitude panning
Sparse, l1-optimal multi-loudspeaker panning and its relation to vector base amplitude panning
Panning techniques, such as vector base amplitude panning (VBAP) are a widely-used practical approach for spatial sound reproduction using multiple loudspeakers. Although limited to a relatively small listening area, they are very efficient and offer good localisation accuracy, timbral quality as well as a graceful degradation of quality outside the sweet spot.The aim of this paper is to investigate optimal sound reproduction techniques that adopt
some of the advantageous properties of VBAP, such as the sparsity and the locality of the active loudspeakers for the reproduction of a single audio object. To this end, we state the task of multi-loudspeaker panning as an l1 optimization problem. We demonstrate and prove that the resulting solutions are exactly sparse. Moreover, we show the effect of adding a nonnegativity constraint on the loudspeaker gains in order to preserve the locality of the panning solution. Adding this constraint, l1-optimal panning can be formulated as a linear program. Using this representation, we prove that unique l1-optimal panning solutions incorporating a nonnegativity constraint are identical to VBAP using a Delaunay triangulation for the loudspeaker setup. Using results from linear programming and duality theory, we describe properties and special cases, such as solution ambiguity, of the VBAP solution.
spatial sound reproduction, amplitude panning, VBAP, sparsity, l1 optimization, compressive sampling, linear programming
996-1010
Franck, Andreas
fa179b73-6a83-4c42-b300-81f1dfe9ef6d
Wang, Wenwu
6369044d-6dbf-4e6e-9c95-dc899ea051ff
Fazi, Filippo
e5aefc08-ab45-47c1-ad69-c3f12d07d807
Franck, Andreas
fa179b73-6a83-4c42-b300-81f1dfe9ef6d
Wang, Wenwu
6369044d-6dbf-4e6e-9c95-dc899ea051ff
Fazi, Filippo
e5aefc08-ab45-47c1-ad69-c3f12d07d807

Franck, Andreas, Wang, Wenwu and Fazi, Filippo (2017) Sparse, l1-optimal multi-loudspeaker panning and its relation to vector base amplitude panning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25 (5), 996-1010. (doi:10.1109/TASLP.2017.2674975).

Record type: Article

Abstract

Panning techniques, such as vector base amplitude panning (VBAP) are a widely-used practical approach for spatial sound reproduction using multiple loudspeakers. Although limited to a relatively small listening area, they are very efficient and offer good localisation accuracy, timbral quality as well as a graceful degradation of quality outside the sweet spot.The aim of this paper is to investigate optimal sound reproduction techniques that adopt
some of the advantageous properties of VBAP, such as the sparsity and the locality of the active loudspeakers for the reproduction of a single audio object. To this end, we state the task of multi-loudspeaker panning as an l1 optimization problem. We demonstrate and prove that the resulting solutions are exactly sparse. Moreover, we show the effect of adding a nonnegativity constraint on the loudspeaker gains in order to preserve the locality of the panning solution. Adding this constraint, l1-optimal panning can be formulated as a linear program. Using this representation, we prove that unique l1-optimal panning solutions incorporating a nonnegativity constraint are identical to VBAP using a Delaunay triangulation for the loudspeaker setup. Using results from linear programming and duality theory, we describe properties and special cases, such as solution ambiguity, of the VBAP solution.

Text
franck_wang_fazi2017_sparse_l1_optimal_panning - Author's Original
Available under License Other.
Download (943kB)
Text
07864372 - Accepted Manuscript
Download (625kB)
Text
franck_wang_fazi2017_sparse_l1-optimal_multiloudspeaker_panning_and_its_equivalence_to_vector_based_amplitude_panning - Version of Record
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 7 February 2017
e-pub ahead of print date: 24 February 2017
Additional Information: All code and data underlying the findings are fully available without restriction. Details of the data and how to request access are available through the DOI: doi.org/10.15126/surreydata.00813551
Keywords: spatial sound reproduction, amplitude panning, VBAP, sparsity, l1 optimization, compressive sampling, linear programming
Organisations: Acoustics Group

Identifiers

Local EPrints ID: 406196
URI: http://eprints.soton.ac.uk/id/eprint/406196
PURE UUID: cfaf88ec-0b1f-4d7a-b335-fa440cdf31de
ORCID for Andreas Franck: ORCID iD orcid.org/0000-0002-4707-6710
ORCID for Filippo Fazi: ORCID iD orcid.org/0000-0003-4129-1433

Catalogue record

Date deposited: 10 Mar 2017 10:41
Last modified: 13 Jun 2024 01:46

Export record

Altmetrics

Contributors

Author: Andreas Franck ORCID iD
Author: Wenwu Wang
Author: Filippo Fazi ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×