The University of Southampton
University of Southampton Institutional Repository

Sparse, l1-optimal multi-loudspeaker panning and its relation to vector base amplitude panning

Franck, Andreas, Wang, Wenwu and Fazi, Filippo (2017) Sparse, l1-optimal multi-loudspeaker panning and its relation to vector base amplitude panning IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25, (5), pp. 996-1010. (doi:10.1109/TASLP.2017.2674975).

Record type: Article

Abstract

Panning techniques, such as vector base amplitude panning (VBAP) are a widely-used practical approach for spatial sound reproduction using multiple loudspeakers. Although limited to a relatively small listening area, they are very efficient and offer good localisation accuracy, timbral quality as well as a graceful degradation of quality outside the sweet spot.The aim of this paper is to investigate optimal sound reproduction techniques that adopt
some of the advantageous properties of VBAP, such as the sparsity and the locality of the active loudspeakers for the reproduction of a single audio object. To this end, we state the task of multi-loudspeaker panning as an l1 optimization problem. We demonstrate and prove that the resulting solutions are exactly sparse. Moreover, we show the effect of adding a nonnegativity constraint on the loudspeaker gains in order to preserve the locality of the panning solution. Adding this constraint, l1-optimal panning can be formulated as a linear program. Using this representation, we prove that unique l1-optimal panning solutions incorporating a nonnegativity constraint are identical to VBAP using a Delaunay triangulation for the loudspeaker setup. Using results from linear programming and duality theory, we describe properties and special cases, such as solution ambiguity, of the VBAP solution.

PDF franck_wang_fazi2017_sparse_l1_optimal_panning - Author's Original
Available under License Other.
Download (943kB)
PDF 07864372 - Accepted Manuscript
Download (625kB)
PDF franck_wang_fazi2017_sparse_l1-optimal_multiloudspeaker_panning_and_its_equivalence_to_vector_based_amplitude_panning - Version of Record
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 7 February 2017
e-pub ahead of print date: 24 February 2017
Additional Information: All code and data underlying the findings are fully available without restriction. Details of the data and how to request access are available through the DOI: doi.org/10.15126/surreydata.00813551
Keywords: spatial sound reproduction, amplitude panning, VBAP, sparsity, l1 optimization, compressive sampling, linear programming
Organisations: Acoustics Group

Identifiers

Local EPrints ID: 406196
URI: http://eprints.soton.ac.uk/id/eprint/406196
PURE UUID: cfaf88ec-0b1f-4d7a-b335-fa440cdf31de
ORCID for Andreas Franck: ORCID iD orcid.org/0000-0002-4707-6710

Catalogue record

Date deposited: 10 Mar 2017 10:41
Last modified: 24 Jul 2017 16:35

Export record

Altmetrics

Contributors

Author: Andreas Franck ORCID iD
Author: Wenwu Wang
Author: Filippo Fazi

University divisions

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×