2018
DOI: 10.1109/tmm.2018.2796248
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild

Abstract: In order to retrieve unlabeled images by textual queries, cross-media similarity computation is a key ingredient. Although novel methods are continuously introduced, little has been done to evaluate these methods together with large-scale query log analysis. Consequently, how far have these methods brought us in answering real-user queries is unclear. Given baseline methods that use relatively simple text/image matching, how much progress have advanced models made is also unclear. This paper takes a pragmatic … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 19 publications
(2 citation statements)
references
References 62 publications
0
2
0
Order By: Relevance
“…Metrics For the task of attribute-specific fashion retrieval, we report the Mean Average Precision (MAP), a popular performance metric in many retrieval-related tasks (Awad et al 2018;Dong, Li, and Xu 2018). For the triplet relation prediction task, we utilize the prediction accuracy as the metric.…”
Section: Evaluation Experimental Setupmentioning
confidence: 99%
“…Metrics For the task of attribute-specific fashion retrieval, we report the Mean Average Precision (MAP), a popular performance metric in many retrieval-related tasks (Awad et al 2018;Dong, Li, and Xu 2018). For the triplet relation prediction task, we utilize the prediction accuracy as the metric.…”
Section: Evaluation Experimental Setupmentioning
confidence: 99%
“…Image retrieval is an important research topic in the field of computer vision and multimedia, and has a wide range of applications [1,2], such as fashion retrieval [3,4] and person re-identification [5,6]. The retrieval system usually only uses a picture or a paragraph of text as input.…”
Section: Introductionmentioning
confidence: 99%