ChefGAN

Pan, Si-Yuan; Dai, Ling; Hou, Xuhong; Li, Huating; Sheng, Bin

doi:10.1145/3394171.3413636

Cited by 19 publications

(2 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Complex images have high visual realism and semantic consistency; the bandwidth resolution is high (design complexity) and consists of various degrees of elements in the layout (feature complexity) intended to evoke the processing of the cues presented. Simple images with low bandwidth resolution without multiple components are designed to dissuade the processing capabilities of the cue given (Da Silva et al, 2011;Pan et al, 2020;Yu & Winkler, 2013). number of elements present in any given layout, and design complexity accounts for the detailed variation in the layout that conveys the primary visual form, such as color, shape, brightness, and edge patterns of a layout.…”

Section: Methodsmentioning

confidence: 99%

Design Complexity: Assessing Cross-Modal Correspondence between Complex Food Images and the Desire to Eat.

2024

IJMCNM

View full text Add to dashboard Cite

Section: Methodsmentioning

confidence: 99%

Design Complexity: Assessing Cross-Modal Correspondence between Complex Food Images and the Desire to Eat.

2024

IJMCNM

View full text Add to dashboard Cite

“…Wang et al [62] introduced a cycle-consistency training method, which improved image generation by optimizing the inverted latent codes. Chef-GAN [63] involved a joint image-recipe embedding model to GANs before and during the stage of generate images. CookGAN [64] mimicked visual effect of instructions and preserved the fine-grained details of images.…”

Section: B Food Image/recipe Generationmentioning

confidence: 99%

CREAMY: Cross-Modal Recipe Retrieval By Avoiding Matching Imperfectly

Zou,

Zhu,

Zhu

et al. 2024

IEEE Access

View full text Add to dashboard Cite

State-of-the-art methods for cross-modal recipe retrieval failed to consider an underlying but challenging issue, i.e., matching imperfectly problem hidden in positive image-recipe pairs, which is a culprit causing over-fitting. To make up this defect, two critical questions-how to effectively recognize and filter out mismatching parts during the model training and how to pick out and preserve as much matching information as possible need to be answered. To do so, this article proposes a novel method-Cross-modal Recipe rEtrieval by Avoiding Matching imperfectlY, abbreviated as CREAMY, which involving a new-designed learning strategy called Non-Matching and Partial-Matching (NMPM) to undertake two tasks: (1) no longer forcibly aligning each positive image-recipe pair but rather capturing the complementary information from negative pairs; (2) delicately picking up and aligning the matchable part in each pair. To the best of our knowledge, this attempt is a pioneer to defeat the matching imperfectly issue for cross-modal recipe retrieval task. Empirical analysis conducted on Recipe1M dataset validates the advantages of CREAMY over several state-of-the-arts. The code is available at: https://github.com/users/pouqual/CREAMY.

show abstract