The University of Southampton
University of Southampton Institutional Repository

Cross euclidean-to-riemannian metric learning with application to face recognition from video

Cross euclidean-to-riemannian metric learning with application to face recognition from video
Cross euclidean-to-riemannian metric learning with application to face recognition from video
Riemannian manifolds have been widely employed for video representations in visual classification tasks including video-based face recognition. The success mainly derives from learning a discriminant Riemannian metric which encodes the non-linear geometry of the underlying Riemannian manifolds. In this paper, we propose a novel metric learning framework to learn a distance metric across a Euclidean space and a Riemannian manifold to fuse average appearance and pattern variation of faces within one video. The proposed metric learning framework can handle three typical tasks of video-based face recognition: Video-to-Still, Still-to-Video and Video-to-Video settings. To accomplish this new framework, by exploiting typical Riemannian geometries for kernel embedding, we map the source Euclidean space and Riemannian manifold into a common Euclidean subspace, each through a corresponding high-dimensional Reproducing Kernel Hilbert Space (RKHS). With this mapping, the problem of learning a cross-view metric between the two source heterogeneous spaces can be converted to learning a single-view Euclidean distance metric in the target common Euclidean space. By learning information on heterogeneous data with the shared label, the discriminant metric in the common space improves face recognition from videos. Extensive experiments on four challenging video face databases demonstrate that the proposed framework has a clear advantage over the state-of-the-art methods in the three classical video-based face recognition scenarios.
1939-3539
2827 - 2840
Huang, Zhiwu
84f477cd-9097-44dd-a33e-ff71f253d36b
Wang, Ruiping
94a608c0-fd66-47c4-9bea-5e708128eb86
Shan, Shiguang
78e49abb-f490-480f-b534-0b7e05c9cbe4
Van Gool, Luc
7aa6fbb4-68f5-4b18-8d99-ba71be78844d
Chen, Xilin
48380269-4169-4310-ae77-30dade3b551b
Huang, Zhiwu
84f477cd-9097-44dd-a33e-ff71f253d36b
Wang, Ruiping
94a608c0-fd66-47c4-9bea-5e708128eb86
Shan, Shiguang
78e49abb-f490-480f-b534-0b7e05c9cbe4
Van Gool, Luc
7aa6fbb4-68f5-4b18-8d99-ba71be78844d
Chen, Xilin
48380269-4169-4310-ae77-30dade3b551b

Huang, Zhiwu, Wang, Ruiping, Shan, Shiguang, Van Gool, Luc and Chen, Xilin (2017) Cross euclidean-to-riemannian metric learning with application to face recognition from video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40 (12), 2827 - 2840. (doi:10.1109/TPAMI.2017.2776154).

Record type: Article

Abstract

Riemannian manifolds have been widely employed for video representations in visual classification tasks including video-based face recognition. The success mainly derives from learning a discriminant Riemannian metric which encodes the non-linear geometry of the underlying Riemannian manifolds. In this paper, we propose a novel metric learning framework to learn a distance metric across a Euclidean space and a Riemannian manifold to fuse average appearance and pattern variation of faces within one video. The proposed metric learning framework can handle three typical tasks of video-based face recognition: Video-to-Still, Still-to-Video and Video-to-Video settings. To accomplish this new framework, by exploiting typical Riemannian geometries for kernel embedding, we map the source Euclidean space and Riemannian manifold into a common Euclidean subspace, each through a corresponding high-dimensional Reproducing Kernel Hilbert Space (RKHS). With this mapping, the problem of learning a cross-view metric between the two source heterogeneous spaces can be converted to learning a single-view Euclidean distance metric in the target common Euclidean space. By learning information on heterogeneous data with the shared label, the discriminant metric in the common space improves face recognition from videos. Extensive experiments on four challenging video face databases demonstrate that the proposed framework has a clear advantage over the state-of-the-art methods in the three classical video-based face recognition scenarios.

This record has no associated files available for download.

More information

Published date: 22 November 2017

Identifiers

Local EPrints ID: 501110
URI: http://eprints.soton.ac.uk/id/eprint/501110
ISSN: 1939-3539
PURE UUID: 65710970-5a46-46f5-a49c-24eaeae550ca
ORCID for Zhiwu Huang: ORCID iD orcid.org/0000-0002-7385-079X

Catalogue record

Date deposited: 23 May 2025 17:15
Last modified: 25 May 2025 05:21

Export record

Altmetrics

Contributors

Author: Zhiwu Huang ORCID iD
Author: Ruiping Wang
Author: Shiguang Shan
Author: Luc Van Gool
Author: Xilin Chen

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×