Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413636
|View full text |Cite
|
Sign up to set email alerts
|

ChefGAN

Abstract: Although significant progress has been made in generating images from the text by using generative adversarial networks (GANs), it is still challenging to deal with long text, which contains complex semantic information like recipes. This paper focuses on generating images with high visual realism and semantic consistency from the complex text of recipes. To achieve this, we propose a GANs based method termed ChefGAN. The critical concept of ChefGAN is that a joint image-recipe embedding model is used before t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 19 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Complex images have high visual realism and semantic consistency; the bandwidth resolution is high (design complexity) and consists of various degrees of elements in the layout (feature complexity) intended to evoke the processing of the cues presented. Simple images with low bandwidth resolution without multiple components are designed to dissuade the processing capabilities of the cue given (Da Silva et al, 2011;Pan et al, 2020;Yu & Winkler, 2013). number of elements present in any given layout, and design complexity accounts for the detailed variation in the layout that conveys the primary visual form, such as color, shape, brightness, and edge patterns of a layout.…”
Section: Methodsmentioning
confidence: 99%
“…Complex images have high visual realism and semantic consistency; the bandwidth resolution is high (design complexity) and consists of various degrees of elements in the layout (feature complexity) intended to evoke the processing of the cues presented. Simple images with low bandwidth resolution without multiple components are designed to dissuade the processing capabilities of the cue given (Da Silva et al, 2011;Pan et al, 2020;Yu & Winkler, 2013). number of elements present in any given layout, and design complexity accounts for the detailed variation in the layout that conveys the primary visual form, such as color, shape, brightness, and edge patterns of a layout.…”
Section: Methodsmentioning
confidence: 99%
“…Wang et al [62] introduced a cycle-consistency training method, which improved image generation by optimizing the inverted latent codes. Chef-GAN [63] involved a joint image-recipe embedding model to GANs before and during the stage of generate images. CookGAN [64] mimicked visual effect of instructions and preserved the fine-grained details of images.…”
Section: B Food Image/recipe Generationmentioning
confidence: 99%