2022
DOI: 10.48550/arxiv.2203.08101
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity

Abstract: An intuitive way to search for images is to use queries composed of an example image and a complementary text. While the first provides rich and implicit context for the search, the latter explicitly calls for new traits, or specifies how some elements of the example image should be changed to retrieve the desired target image. Current approaches typically combine the features of each of the two elements of the query into a single representation, which can then be compared to the ones of the potential target i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 32 publications
0
3
0
Order By: Relevance
“…Metrics. Following previous works (Baldrati et al 2022;Delmas et al 2022;Zhao, Song, and Jin 2022), we employ Recall within top-K as the retrieval performance, which indicates the ratio of the ground-truth target image in the top-K ranking list that is correctly retrieved.…”
Section: Methodsmentioning
confidence: 99%
“…Metrics. Following previous works (Baldrati et al 2022;Delmas et al 2022;Zhao, Song, and Jin 2022), we employ Recall within top-K as the retrieval performance, which indicates the ratio of the ground-truth target image in the top-K ranking list that is correctly retrieved.…”
Section: Methodsmentioning
confidence: 99%
“…However, interactive recommendation in design scenarios depend on the user's intuitive visual experience. Therefore, visually-grounded dialog system researches have also emerged in e-commerce platforms in recent years [2,10,39,40]. For example, Yuan et al [39] proposes a conversational fashion image retrieval method that predicts the desired image based on text and image information in the conversation history.…”
Section: Interactive Recommendationmentioning
confidence: 99%
“…The interior design data used in this paper were gathered from a popular Chinese interior design website 2 . The data comprises (The open design of entire dining room weakens the TV-centered layout and focuses on comfort to create a comfortable living atmosphere.…”
Section: Data Descriptionmentioning
confidence: 99%