The University of Southampton
University of Southampton Institutional Repository

Speaker tracking in reverberant environments using multiple directions of arrival

Speaker tracking in reverberant environments using multiple directions of arrival
Speaker tracking in reverberant environments using multiple directions of arrival

Accurate estimation of the Direction of Arrival (DOA) of a sound source is an important prerequisite for a wide range of acoustic signal processing applications. However, in enclosed environments, early reflections and late reverberation often lead to localization errors. Recent work demonstrated that improved robustness against reverberation can be achieved by clustering only the DOAs from direct-path bins in the short-term Fourier transform of a speech signal of several seconds duration from a static talker. Nevertheless, for moving talkers, short blocks of at most several hundred milliseconds are required to capture the spatio-temporal variation of the source direction. Processing of short blocks of data in reverberant environment can lead to clusters whose centroids correspond to spurious DOAs away from the source direction. We therefore propose in this paper a novel multi-detection source tracking approach that estimates the smoothed trajectory of the source DOAs. Results for realistic room simulations validate the proposed approach and demonstrate significant improvements in estimation accuracy compared to single-detection tracking.

Bayes methods, Direction-of-arrival estimation, Motion estimation, Speech processing
91-95
IEEE
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Rafaely, Boaz
0839a7e8-bdbc-4f46-b57b-a9e5c625e968
Naylor, Patrick A.
13079486-664a-414c-a1a2-01a30bf0997b
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Rafaely, Boaz
0839a7e8-bdbc-4f46-b57b-a9e5c625e968
Naylor, Patrick A.
13079486-664a-414c-a1a2-01a30bf0997b

Evers, Christine, Rafaely, Boaz and Naylor, Patrick A. (2017) Speaker tracking in reverberant environments using multiple directions of arrival. In 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017 - Proceedings. IEEE. pp. 91-95 . (doi:10.1109/HSCMA.2017.7895568).

Record type: Conference or Workshop Item (Paper)

Abstract

Accurate estimation of the Direction of Arrival (DOA) of a sound source is an important prerequisite for a wide range of acoustic signal processing applications. However, in enclosed environments, early reflections and late reverberation often lead to localization errors. Recent work demonstrated that improved robustness against reverberation can be achieved by clustering only the DOAs from direct-path bins in the short-term Fourier transform of a speech signal of several seconds duration from a static talker. Nevertheless, for moving talkers, short blocks of at most several hundred milliseconds are required to capture the spatio-temporal variation of the source direction. Processing of short blocks of data in reverberant environment can lead to clusters whose centroids correspond to spurious DOAs away from the source direction. We therefore propose in this paper a novel multi-detection source tracking approach that estimates the smoothed trajectory of the source DOAs. Results for realistic room simulations validate the proposed approach and demonstrate significant improvements in estimation accuracy compared to single-detection tracking.

This record has no associated files available for download.

More information

Published date: 10 April 2017
Venue - Dates: 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017, , San Francisco, United States, 2017-03-01 - 2017-03-03
Keywords: Bayes methods, Direction-of-arrival estimation, Motion estimation, Speech processing

Identifiers

Local EPrints ID: 445092
URI: http://eprints.soton.ac.uk/id/eprint/445092
PURE UUID: 4268d506-1070-4062-a293-ec35966ec241
ORCID for Christine Evers: ORCID iD orcid.org/0000-0003-0757-5504

Catalogue record

Date deposited: 19 Nov 2020 17:32
Last modified: 17 Mar 2024 04:01

Export record

Altmetrics

Contributors

Author: Christine Evers ORCID iD
Author: Boaz Rafaely
Author: Patrick A. Naylor

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×