Distance Metric Learning Using Privileged Information for Face Verification and Person Re-Identification

Xu, Xinxing; Wen, Li; Xu, Dong

doi:10.1109/tnnls.2015.2405574

Cited by 93 publications

(45 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…After that, many variants of SVM+ have been proposed for solving different tasks [24,12,34,29,33,23]. In [24], Liang and Cherkassky developed a multi-task learning approach based on SVM+.…”

Section: Related Workmentioning

confidence: 99%

“…In [17], a multi-task multi-class extension of SVM+ was proposed. Fouad et al [12] designed a two-step approach for metric learning, and Xu et al [34] formulated a convex formulation for metric learning using privileged information based on the information theory metric learning (ITML) method. Sharmanska et al [29] proposed the Rank Transfer method for utilizing privileged information, and demonstrated the effectiveness of privileged information in various computer vision tasks.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Fast Algorithms for Linear and Kernel SVM+

Dai

Tan

et al. 2016

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Self Cite

View full text Add to dashboard Cite

“…After that, many variants of SVM+ have been proposed for solving different tasks [24,12,34,29,33,23]. In [24], Liang and Cherkassky developed a multi-task learning approach based on SVM+.…”

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Fast Algorithms for Linear and Kernel SVM+

Dai

Tan

et al. 2016

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Self Cite

View full text Add to dashboard Cite

“…In contrast to the above supervised methods, side information, which can be collected in an unsupervised manner and indicates that certain examples belong to the same class, can be used by the relevance component analysis (RCA) method to learn a Mahalanobis metric [5]. Additional information, such as depth information, can be used with a modified version of information theoretic metric learning [11] to improve re-identification performance [59]. The fact that people move through camera networks has been used to learn multiple related Mahalanobis distance metrics between camera pairs [42], however, this approach required knowledge of the camera network layout and different training for each camera pair.…”

Section: B Metric Learningmentioning

confidence: 99%

Person Reidentification Using Deep Convnets With Multitask Learning

McLaughlin

Rincón

Miller

2017

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

Abstract-Person re-identification involves recognizing a person across non-overlapping camera views, with different pose, illumination, and camera characteristics. We propose to tackle this problem by training a deep convolutional network to represent a person's appearance as a low-dimensional feature vector that is invariant to common appearance variations encountered in the re-identification problem. Specifically, a Siamese-network architecture is used to train a feature extraction network using pairs of similar and dissimilar images. We show that use of a novel multi-task learning objective is crucial for regularizing the network parameters in order to prevent over-fitting due to the small size the training dataset. We complement the verification task, which is at the heart of re-identification, by training the network to jointly perform verification, identification, and to recognise attributes related to the clothing and pose of the person in each image. Additionally, we show that our proposed approach performs well even in the challenging cross-dataset scenario, which may better reflect real-world expected performance.

show abstract

“…In the LUPI paradigm, the training samples are associated with additional features that are not available for the testing data, which are referred to as PI. In some recent works [9], [39]- [41], PI was exploited for different computer vision tasks. In [39], a rank SVM method was proposed to rank Web images based on PI.…”

Section: Related Workmentioning

confidence: 99%

“…In [39], a rank SVM method was proposed to rank Web images based on PI. In [40] and [41], PI was incorporated into distance metric learning. However, these works assume that the training data and the testing data are with the same data distribution, while this assumption does not hold in our setting.…”

Section: Related Workmentioning

confidence: 99%

Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization

Niu

et al. 2017

IEEE Trans. Neural Netw. Learning Syst.

Self Cite

View full text Add to dashboard Cite

In this paper, a weakly supervised domain generalization (WSDG) method is proposed for real-world visual recognition tasks, in which we train classifiers by using Web data (e.g., Web images and Web videos) with noisy labels. In particular, two challenging problems need to be solved when learning robust classifiers, in which the first issue is to cope with the label noise of training Web data from the source domain, while the second issue is to enhance the generalization capability of learned classifiers to an arbitrary target domain. In order to handle the first problem, the training samples within each category are partitioned into clusters, where we use one bag to denote each cluster and instances to denote the samples in each cluster. Then, we identify a proportion of good training samples in each bag and train robust classifiers by using the good training samples, which leads to a multi-instance learning (MIL) problem. In order to handle the second problem, we assume that the training samples possibly form a set of hidden domains, with each hidden domain associated with a distinctive data distribution. Then, for each category and each hidden latent domain, we propose to learn one classifier by extending our MIL formulation, which leads to our WSDG approach. In the testing stage, our approach can obtain better generalization capability by effectively integrating multiple classifiers from different latent domains in each category. Moreover, our WSDG approach is further extended to utilize additional textual descriptions associated with Web data as privileged information (PI), although testing data do not have such PI. Extensive experiments on three benchmark data sets indicate that our newly proposed methods are effective for real-world visual recognition tasks by learning from Web data.

show abstract

Distance Metric Learning Using Privileged Information for Face Verification and Person Re-Identification

Cited by 93 publications

References 33 publications

Fast Algorithms for Linear and Kernel SVM+

Fast Algorithms for Linear and Kernel SVM+

Person Reidentification Using Deep Convnets With Multitask Learning

Visual Recognition by Learning From Web Data via Weakly Supervised Domain Generalization

Contact Info

Product

Resources

About