Speaker tracking in reverberant environments using multiple directions of arrival
Speaker tracking in reverberant environments using multiple directions of arrival
Accurate estimation of the Direction of Arrival (DOA) of a sound source is an important prerequisite for a wide range of acoustic signal processing applications. However, in enclosed environments, early reflections and late reverberation often lead to localization errors. Recent work demonstrated that improved robustness against reverberation can be achieved by clustering only the DOAs from direct-path bins in the short-term Fourier transform of a speech signal of several seconds duration from a static talker. Nevertheless, for moving talkers, short blocks of at most several hundred milliseconds are required to capture the spatio-temporal variation of the source direction. Processing of short blocks of data in reverberant environment can lead to clusters whose centroids correspond to spurious DOAs away from the source direction. We therefore propose in this paper a novel multi-detection source tracking approach that estimates the smoothed trajectory of the source DOAs. Results for realistic room simulations validate the proposed approach and demonstrate significant improvements in estimation accuracy compared to single-detection tracking.
Bayes methods, Direction-of-arrival estimation, Motion estimation, Speech processing
91-95
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Rafaely, Boaz
0839a7e8-bdbc-4f46-b57b-a9e5c625e968
Naylor, Patrick A.
13079486-664a-414c-a1a2-01a30bf0997b
10 April 2017
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Rafaely, Boaz
0839a7e8-bdbc-4f46-b57b-a9e5c625e968
Naylor, Patrick A.
13079486-664a-414c-a1a2-01a30bf0997b
Evers, Christine, Rafaely, Boaz and Naylor, Patrick A.
(2017)
Speaker tracking in reverberant environments using multiple directions of arrival.
In 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017 - Proceedings.
IEEE.
.
(doi:10.1109/HSCMA.2017.7895568).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Accurate estimation of the Direction of Arrival (DOA) of a sound source is an important prerequisite for a wide range of acoustic signal processing applications. However, in enclosed environments, early reflections and late reverberation often lead to localization errors. Recent work demonstrated that improved robustness against reverberation can be achieved by clustering only the DOAs from direct-path bins in the short-term Fourier transform of a speech signal of several seconds duration from a static talker. Nevertheless, for moving talkers, short blocks of at most several hundred milliseconds are required to capture the spatio-temporal variation of the source direction. Processing of short blocks of data in reverberant environment can lead to clusters whose centroids correspond to spurious DOAs away from the source direction. We therefore propose in this paper a novel multi-detection source tracking approach that estimates the smoothed trajectory of the source DOAs. Results for realistic room simulations validate the proposed approach and demonstrate significant improvements in estimation accuracy compared to single-detection tracking.
This record has no associated files available for download.
More information
Published date: 10 April 2017
Venue - Dates:
2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017, , San Francisco, United States, 2017-03-01 - 2017-03-03
Keywords:
Bayes methods, Direction-of-arrival estimation, Motion estimation, Speech processing
Identifiers
Local EPrints ID: 445092
URI: http://eprints.soton.ac.uk/id/eprint/445092
PURE UUID: 4268d506-1070-4062-a293-ec35966ec241
Catalogue record
Date deposited: 19 Nov 2020 17:32
Last modified: 17 Mar 2024 04:01
Export record
Altmetrics
Contributors
Author:
Christine Evers
Author:
Boaz Rafaely
Author:
Patrick A. Naylor
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics