Kernel-based distance metric learning for content-based image retrieval

Chang, Hong; Yeung, Dit–Yan

doi:10.1016/j.imavis.2006.05.013

Cited by 52 publications

(28 citation statements)

References 22 publications

(34 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As another direction, we will incorporate dissimilarity constraints into the methods to further improve the metric learning performance. Moreover, we will explore the application of the proposed methods to other real-world problems such as content-based image retrieval [7], [8], [9].…”

Section: Discussionmentioning

confidence: 99%

A Kernel Approach for Semisupervised Metric Learning

Yeung

Chang

2007

IEEE Trans. Neural Netw.

View full text Add to dashboard Cite

Abstract-While distance function learning for supervised learning tasks has a long history, extending it to learning tasks with weaker supervisory information has only been studied recently. In particular, some methods have been proposed for semi-supervised metric learning based on pairwise similarity or dissimilarity information. In this paper, we propose a kernel approach for semi-supervised metric learning and present in detail two special cases of this kernel approach. The metric learning problem is thus formulated as an optimization problem for kernel learning. An attractive property of the optimization problem is that it is convex and hence has no local optima. While a closed-form solution exists for the first special case, the second case is solved using an iterative majorization procedure to estimate the optimal solution asymptotically. Experimental results based on both synthetic and real-world data show that this new kernel approach is promising for nonlinear metric learning.

show abstract

Section: Discussionmentioning

confidence: 99%

A Kernel Approach for Semisupervised Metric Learning

Yeung

Chang

2007

IEEE Trans. Neural Netw.

View full text Add to dashboard Cite

show abstract

“…DML methods can be used to find a linear transformation that projects the image features to a new meaningful feature space to reduce this semantic gap. Previous work showed that appropriately designed distance metrics could improve CBIR performance compared with Euclidean distance [27]. For the BoW model, the semantic meaning of visual words is ambiguous; thus, the retrieval performance can be improved by embedding the semantic information to BoW representation by supervised DML.…”

Section: Bag-of-visual-words Representation Of Lesionsmentioning

confidence: 99%

Content-Based Retrieval of Focal Liver Lesions Using Bag-of-Visual-Words Representations of Single- and Multiphase Contrast-Enhanced CT Images

Yang

et al. 2012

J Digit Imaging

View full text Add to dashboard Cite

This paper is aimed at developing and evaluating a content-based retrieval method for contrastenhanced liver computed tomographic (CT) images using bag-of-visual-words (BoW) representations of single and multiple phases. The BoW histograms are extracted using the raw intensity as local patch descriptor for each enhance phase by densely sampling the image patches within the liver lesion regions. The distance metric learning algorithms are employed to obtain the semantic similarity on the Hellinger kernel feature map of the BoW histograms. The different visual vocabularies for BoW and learned distance metrics are evaluated in a contrast-enhanced CT image dataset comprised of 189 patients with three types of focal liver lesions, including 87 hepatomas, 62 cysts, and 60 hemangiomas. For each single enhance phase, the mean of average precision (mAP) of BoW representations for retrieval can reach above 90 % which is significantly higher than that of intensity histogram and Gabor filters. Furthermore, the combined BoW representations of the three enhance phases can improve mAP to 94.5 %. These preliminary results demonstrate that the BoW representation is effective and feasible for retrieval of liver lesions in contrast-enhanced CT images.

show abstract

“…Multiple regression analysis is widely used since it can weight each feature, but "distance metric learning [14]" (DML) is more effective since it can take side information into account. Many studies on DML have demonstrated its usefulness in applications such as image retrieval [15], music retrieval [16], and sentence retrieval [17]. This technique can realize speaker selection if the side information is set properly.…”

Section: Introductionmentioning

confidence: 99%

“…In addition, DML has also been used for feature space transformation in a number of studies. For instance, [15] used transformation of the original image space for image retrieval. In this paper, since the perceptual voice quality similarity is used as the side information, DML can be considered to be transformation from acoustic feature space to perceptual voice quality similarity space.…”

Section: Introductionmentioning

confidence: 99%

Similar Speaker Selection Technique Based on Distance Metric Learning Using Highly Correlated Acoustic Features with Perceptual Voice Quality Similarity

Ijima

Mizuno

2015

IEICE Trans. Inf. ^|^ Syst.

View full text Add to dashboard Cite

SUMMARYThis paper analyzes the correlation between various acoustic features and perceptual voice quality similarity, and proposes a perceptually similar speaker selection technique based on distance metric learning. To analyze the relationship between acoustic features and voice quality similarity, we first conduct a large-scale subjective experiment using the voices of 62 female speakers and perceptual voice quality similarity scores between all pairs of speakers are acquired. Next, multiple linear regression analysis is carried out; it shows that four acoustic features are highly correlated to voice quality similarity. The proposed speaker selection technique first trains a transform matrix based on distance metric learning using the perceptual voice quality similarity acquired in the subjective experiment. Given an input speech, acoustic features of the input speech are transformed using the trained transform matrix, after which speaker selection is performed based on the Euclidean distance on the transformed acoustic feature space. We perform speaker selection experiments and evaluate the performance of the proposed technique by comparing it to speaker selection without feature space transformation. The results indicate that transformation based on distance metric learning reduces the error rate by 53.9%.

show abstract

Kernel-based distance metric learning for content-based image retrieval

Cited by 52 publications

References 22 publications

A Kernel Approach for Semisupervised Metric Learning

A Kernel Approach for Semisupervised Metric Learning

Content-Based Retrieval of Focal Liver Lesions Using Bag-of-Visual-Words Representations of Single- and Multiphase Contrast-Enhanced CT Images

Similar Speaker Selection Technique Based on Distance Metric Learning Using Highly Correlated Acoustic Features with Perceptual Voice Quality Similarity

Contact Info

Product

Resources

About