Hybrid Euclidean-and-Riemannian Metric Learning for Image Set Classification

Huang, Zhiwu; Wang, Ruiping; Shan, Shiguang; Chen, Xilin

doi:10.1007/978-3-319-16811-1_37

Cited by 31 publications

(51 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In fact, this work is an extension of our previous work [33]. The differences between this work and the conference paper are as follows: (1) this paper extends the Single Gaussian Model (SGM) in the conference version to Gaussian Mixture Model (GMM), which is essentially a general version of SGM, for modeling the Gaussian distribution.…”

Section: Introductionmentioning

confidence: 92%

Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning

Huang

Wang

Shan

et al. 2015

Pattern Recognition

Self Cite

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 92%

Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning

Huang

Wang

Shan

et al. 2015

Pattern Recognition

Self Cite

View full text Add to dashboard Cite

“…to HERML-DeLF), which is basically the HERML method [13] for image set classification with image features learned by a deep neural network 4 For the feature learning part of our HERML-DeLF method, a deep convolutional neural network (DCNN) model is trained on 256 by 256 pixel face images. For a fair comparison, we normalize the face images using eye positions provided by the organizers of PaSC [2].…”

Section: A Chinese Academy Of Science (Cas)mentioning

confidence: 99%

“…Using the DCNN features, the HERML method [13] is then used to compute video similarity by fusing three different set-based video representations. Specifically, for each video, the DCNN features of all video frames are first pooled respectively by sample mean, sample covariance matrix and Gaussian model, which form three types of setbased video representations.…”

Section: A Chinese Academy Of Science (Cas)mentioning

confidence: 99%

“…Specifically, for each video, the DCNN features of all video frames are first pooled respectively by sample mean, sample covariance matrix and Gaussian model, which form three types of setbased video representations. Then, by applying the kernel functions proposed in [13] for set-based representations, three kernel matrices are computed and fed separately into kernel linear discriminant analysis (KLDA) [1]. Here, instead of the original metric fusing method in [13], we exploit the KLDA to learn three projective functions respectively [28].…”

Section: A Chinese Academy Of Science (Cas)mentioning

confidence: 99%

“…Then, by applying the kernel functions proposed in [13] for set-based representations, three kernel matrices are computed and fed separately into kernel linear discriminant analysis (KLDA) [1]. Here, instead of the original metric fusing method in [13], we exploit the KLDA to learn three projective functions respectively [28]. The resulting projective functions are then used to produce three 440 dimensional feature vectors for each video.…”

Section: A Chinese Academy Of Science (Cas)mentioning

confidence: 99%

See 2 more Smart Citations

Report on the FG 2015 Video Person Recognition Evaluation

Beveridge

Zhang

Draper

et al. 2015

2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG)

View full text Add to dashboard Cite

show abstract

DreamNet: A Deep Riemannian Manifold Network for SPD Matrix Learning

Wang

Chen

et al. 2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Symmetric positive definite (SPD) matrix has been demonstrated to be an effective feature descriptor in many scientific areas, as it can encode spatiotemporal statistics of the data adequately on a curved Riemannian manifold, i.e., SPD manifold. Although there are many different ways to design network architectures for SPD matrix nonlinear learning, very few solutions explicitly mine the geometrical dependencies of features at different layers. Motivated by the great success of self-attention mechanism in capturing long-range relationships, an SPD manifold self-attention mechanism (SMSA) is proposed in this paper using some manifold-valued geometric operations, mainly the Riemannian metric, Riemannian mean, and Riemannian optimization. Then, an SMSA-based geometric learning module (SMSA-GLM) is designed for the sake of improving the discrimination of the generated deep structured representations. Extensive experimental results achieved on three benchmarking datasets show that our modification against the baseline network further alleviates the information degradation problem and leads to improved accuracy.

show abstract

Hybrid Euclidean-and-Riemannian Metric Learning for Image Set Classification

Cited by 31 publications

References 27 publications

Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning

Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning

Report on the FG 2015 Video Person Recognition Evaluation

DreamNet: A Deep Riemannian Manifold Network for SPD Matrix Learning

Contact Info

Product

Resources

About