Audio-visual tracking by density approximation in a sequential Bayesian filtering framework
Audio-visual tracking by density approximation in a sequential Bayesian filtering framework
This paper proposes a novel audio-visual tracking approach that exploits constructively audio and visual modalities in order to estimate trajectories of multiple people in a joint state space. The tracking problem is modeled using a sequential Bayesian filtering framework. Within this framework, we propose to represent the posterior density with a Gaussian Mixture Model (GMM). To ensure that a GMM representation can be retained sequentially over time, the predictive density is approximated by a GMM using the Unscented Transform. While a density interpolation technique is introduced to obtain a continuous representation of the observation likelihood, which is also a GMM. Furthermore, to prevent the number of mixtures from growing exponentially over time, a density approximation based on the Expectation Maximization (EM) algorithm is applied, resulting in a compact GMM representation of the posterior density. Recordings using a camcorder and microphone array are used to evaluate the proposed approach, demonstrating significant improvements in tracking performance of the proposed audio-visual approach compared to two benchmark visual trackers.
Audio-visual systems, Bayes methods, Machine vision, Motion estimation, Speech processing
71-75
Gebru, Israel D.
de707c41-541c-4b5b-b4c6-d63486b18684
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Naylor, Patrick A.
13079486-664a-414c-a1a2-01a30bf0997b
Horaud, Radu
86f58721-f833-4200-a8ce-959babf9c522
10 April 2017
Gebru, Israel D.
de707c41-541c-4b5b-b4c6-d63486b18684
Evers, Christine
93090c84-e984-4cc3-9363-fbf3f3639c4b
Naylor, Patrick A.
13079486-664a-414c-a1a2-01a30bf0997b
Horaud, Radu
86f58721-f833-4200-a8ce-959babf9c522
Gebru, Israel D., Evers, Christine, Naylor, Patrick A. and Horaud, Radu
(2017)
Audio-visual tracking by density approximation in a sequential Bayesian filtering framework.
In 2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017 - Proceedings.
IEEE.
.
(doi:10.1109/HSCMA.2017.7895564).
Record type:
Conference or Workshop Item
(Paper)
Abstract
This paper proposes a novel audio-visual tracking approach that exploits constructively audio and visual modalities in order to estimate trajectories of multiple people in a joint state space. The tracking problem is modeled using a sequential Bayesian filtering framework. Within this framework, we propose to represent the posterior density with a Gaussian Mixture Model (GMM). To ensure that a GMM representation can be retained sequentially over time, the predictive density is approximated by a GMM using the Unscented Transform. While a density interpolation technique is introduced to obtain a continuous representation of the observation likelihood, which is also a GMM. Furthermore, to prevent the number of mixtures from growing exponentially over time, a density approximation based on the Expectation Maximization (EM) algorithm is applied, resulting in a compact GMM representation of the posterior density. Recordings using a camcorder and microphone array are used to evaluate the proposed approach, demonstrating significant improvements in tracking performance of the proposed audio-visual approach compared to two benchmark visual trackers.
This record has no associated files available for download.
More information
Published date: 10 April 2017
Venue - Dates:
2017 Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017, , San Francisco, United States, 2017-03-01 - 2017-03-03
Keywords:
Audio-visual systems, Bayes methods, Machine vision, Motion estimation, Speech processing
Identifiers
Local EPrints ID: 445098
URI: http://eprints.soton.ac.uk/id/eprint/445098
PURE UUID: 7872dfe0-1ca4-4209-9782-d89c9b0fba71
Catalogue record
Date deposited: 19 Nov 2020 17:32
Last modified: 17 Mar 2024 04:01
Export record
Altmetrics
Contributors
Author:
Israel D. Gebru
Author:
Christine Evers
Author:
Patrick A. Naylor
Author:
Radu Horaud
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics