Anonymizing speaker voices: easy to imitate, difficult to recognize?
Anonymizing speaker voices: easy to imitate, difficult to recognize?
A vastly under-explored area in speech anonymization involves characterizing how different speakers perform in voice privacy tasks. In this paper, we present a deeper analysis by creating and analyzing groups of challenging speakers categorized based on their performance in two related facets of voice anonymization evaluation: (1) speaker similarity using automatic speaker verification (ASV) and (2) human perception using a large-scale A/B listening test. We group speakers into four categories (sheep, goats, lambs, and wolves) based on their anonymization properties. We present an extension of voice anonymization evaluation by identifying speakers who are easy to imitate or difficult to recognize. This knowledge is important for trustworthy anonymization evaluation, and it has the potential to influence how evaluation datasets are created from a pool of speakers. We provide further insights on speaker influence on anonymized speech between human perception and automatic speaker similarity scoring.
12491-12495
Williams, Jennifer
3a1568b4-8a0b-41d2-8635-14fe69fbb360
Pizzi, Karla
fb24c310-1e1b-4961-b365-910ab4412a3f
Tomashenko, Natalia
fab2f42d-ab47-4b2b-90a1-b9b2d0dc4591
Das, Sneha
0857b93b-9275-49c2-9c0d-c193fd15ffb3
18 March 2024
Williams, Jennifer
3a1568b4-8a0b-41d2-8635-14fe69fbb360
Pizzi, Karla
fb24c310-1e1b-4961-b365-910ab4412a3f
Tomashenko, Natalia
fab2f42d-ab47-4b2b-90a1-b9b2d0dc4591
Das, Sneha
0857b93b-9275-49c2-9c0d-c193fd15ffb3
Williams, Jennifer, Pizzi, Karla, Tomashenko, Natalia and Das, Sneha
(2024)
Anonymizing speaker voices: easy to imitate, difficult to recognize?
In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
IEEE.
.
(doi:10.1109/ICASSP48485.2024.10445935).
Record type:
Conference or Workshop Item
(Paper)
Abstract
A vastly under-explored area in speech anonymization involves characterizing how different speakers perform in voice privacy tasks. In this paper, we present a deeper analysis by creating and analyzing groups of challenging speakers categorized based on their performance in two related facets of voice anonymization evaluation: (1) speaker similarity using automatic speaker verification (ASV) and (2) human perception using a large-scale A/B listening test. We group speakers into four categories (sheep, goats, lambs, and wolves) based on their anonymization properties. We present an extension of voice anonymization evaluation by identifying speakers who are easy to imitate or difficult to recognize. This knowledge is important for trustworthy anonymization evaluation, and it has the potential to influence how evaluation datasets are created from a pool of speakers. We provide further insights on speaker influence on anonymized speech between human perception and automatic speaker similarity scoring.
This record has no associated files available for download.
More information
Published date: 18 March 2024
Venue - Dates:
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), , Seoul, Korea, Republic of, 2024-04-14
Identifiers
Local EPrints ID: 502698
URI: http://eprints.soton.ac.uk/id/eprint/502698
PURE UUID: 993c949a-90bc-4126-b6fc-5438f3e426a1
Catalogue record
Date deposited: 04 Jul 2025 16:47
Last modified: 05 Jul 2025 02:09
Export record
Altmetrics
Contributors
Author:
Jennifer Williams
Author:
Karla Pizzi
Author:
Natalia Tomashenko
Author:
Sneha Das
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics