Combining background noise and artificial masking to achieve privacy in sound zones
Combining background noise and artificial masking to achieve privacy in sound zones
A private sound zone can be created by focusing a spoken message towards a target listener using a loudspeaker array. In practice, however, the reproduced speech cannot be completely contained within the target zone due to practical limits on the directivity of the array. Despite these limitations, the privacy of the message can be maintained if the leaked speech is sufficiently masked by noise. Two possible sources of this masking noise are considered in this article: the ambient noise in the reproduction environment, and an additional masking signal radiated by the loudspeaker array. The present article demonstrates that the process of designing a private audio system is significantly affected by the presence of ambient noise. A key complication is that temporal fluctuations and spatial non-uniformity in the ambient noise can reduce its effectiveness as a masker. These features also make it more difficult to estimate the corresponding reduction in the intelligibility of speech in each listening zone. To mitigate this spatial and temporal variance, it is proposed that systems should be designed to rely only on the masking provided by the diffuse, quasi-stationary background noise component of the environmental noise. It is shown that when systems utilise a combination of the background noise and an additional, artificial masker, a lower level of acoustic contrast is required from the system, compared to the case where the masking is supplied by the background noise exclusively.
Auditory masking, Signal processing, Sound zones, Speech privacy
Wallace, Daniel
93e478ef-92e9-4293-ad2c-ad851c1a10b8
Cheer, Jordan
8e452f50-4c7d-4d4e-913a-34015e99b9dc
March 2022
Wallace, Daniel
93e478ef-92e9-4293-ad2c-ad851c1a10b8
Cheer, Jordan
8e452f50-4c7d-4d4e-913a-34015e99b9dc
Wallace, Daniel and Cheer, Jordan
(2022)
Combining background noise and artificial masking to achieve privacy in sound zones.
Computer Speech and Language, 72, [101285].
(doi:10.1016/j.csl.2021.101285).
Abstract
A private sound zone can be created by focusing a spoken message towards a target listener using a loudspeaker array. In practice, however, the reproduced speech cannot be completely contained within the target zone due to practical limits on the directivity of the array. Despite these limitations, the privacy of the message can be maintained if the leaked speech is sufficiently masked by noise. Two possible sources of this masking noise are considered in this article: the ambient noise in the reproduction environment, and an additional masking signal radiated by the loudspeaker array. The present article demonstrates that the process of designing a private audio system is significantly affected by the presence of ambient noise. A key complication is that temporal fluctuations and spatial non-uniformity in the ambient noise can reduce its effectiveness as a masker. These features also make it more difficult to estimate the corresponding reduction in the intelligibility of speech in each listening zone. To mitigate this spatial and temporal variance, it is proposed that systems should be designed to rely only on the masking provided by the diffuse, quasi-stationary background noise component of the environmental noise. It is shown that when systems utilise a combination of the background noise and an additional, artificial masker, a lower level of acoustic contrast is required from the system, compared to the case where the masking is supplied by the background noise exclusively.
Text
Combining Background Noise-1
- Accepted Manuscript
More information
Accepted/In Press date: 30 August 2021
e-pub ahead of print date: 9 September 2021
Published date: March 2022
Additional Information:
Funding Information:
Daniel Wallace was supported by the EPSRC Centre for Doctoral Training in Next Generation Computational Modelling Grant No. EP/L015382/1.
Publisher Copyright:
© 2021 Elsevier Ltd
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
Keywords:
Auditory masking, Signal processing, Sound zones, Speech privacy
Identifiers
Local EPrints ID: 451889
URI: http://eprints.soton.ac.uk/id/eprint/451889
ISSN: 0885-2308
PURE UUID: 20410c74-bcea-4913-b5a5-00c53b473b5a
Catalogue record
Date deposited: 02 Nov 2021 17:43
Last modified: 12 Nov 2024 05:05
Export record
Altmetrics
Contributors
Author:
Daniel Wallace
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics