2024
DOI: 10.1109/access.2024.3370158
|View full text |Cite
|
Sign up to set email alerts
|

CREAMY: Cross-Modal Recipe Retrieval By Avoiding Matching Imperfectly

Zhuoyang Zou,
Xinghui Zhu,
Qinying Zhu
et al.

Abstract: State-of-the-art methods for cross-modal recipe retrieval failed to consider an underlying but challenging issue, i.e., matching imperfectly problem hidden in positive image-recipe pairs, which is a culprit causing over-fitting. To make up this defect, two critical questions-how to effectively recognize and filter out mismatching parts during the model training and how to pick out and preserve as much matching information as possible need to be answered. To do so, this article proposes a novel method-Cross-mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 69 publications
0
2
0
Order By: Relevance
“…In line with prior research [ 49 ], we use food images with a depth of three channels in the RGB color space. All the images in our experiments are resized to 256 pixels in their shorter dimension and then cropped to pixels.…”
Section: Experiments and Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In line with prior research [ 49 ], we use food images with a depth of three channels in the RGB color space. All the images in our experiments are resized to 256 pixels in their shorter dimension and then cropped to pixels.…”
Section: Experiments and Discussionmentioning
confidence: 99%
“…An overview of our method MMACMR is depicted in Figure 2 . Following prevailing solutions [ 29 , 49 ], the backbone of MMACMR comprises an image encoder and a recipe encoder which project food images and recipes into a common feature subspace. In this subspace, the cross-modal features can be aligned effectively so that the similarity between images and recipes can be measured with accuracy.…”
Section: Methodsmentioning
confidence: 99%