2022
DOI: 10.1109/tvcg.2021.3067201
|View full text |Cite
|
Sign up to set email alerts
|

GAN-Based Multi-Style Photo Cartoonization

Abstract: Cartoon is a common form of art in our daily life and automatic generation of cartoon images from photos is highly desirable. However, state-of-the-art single-style methods can only generate one style of cartoon images from photos and existing multi-style image style transfer methods still struggle to produce high-quality cartoon images due to their highly simplified and abstract nature. In this paper, we propose a novel multi-style generative adversarial network (GAN) architecture, called MS-CartoonGAN, which… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 29 publications
(14 citation statements)
references
References 32 publications
0
3
0
Order By: Relevance
“…If the output image does not clearly reproduce the edges, but has the correct shading, it becomes difficult to train the discriminator. To circumvent this problem, in this paper, we use guided filtering for edge smoothing on the trained cartoon image [16], where general filters are isotropic filters, such as Gaussian filters, which smooth the image while erasing some of the edge frame details. Whereas guided filtering smooths the graph without blurring out the image edges and contours.…”
Section: Architecture Of the Discriminatormentioning
confidence: 99%
“…If the output image does not clearly reproduce the edges, but has the correct shading, it becomes difficult to train the discriminator. To circumvent this problem, in this paper, we use guided filtering for edge smoothing on the trained cartoon image [16], where general filters are isotropic filters, such as Gaussian filters, which smooth the image while erasing some of the edge frame details. Whereas guided filtering smooths the graph without blurring out the image edges and contours.…”
Section: Architecture Of the Discriminatormentioning
confidence: 99%
“…FID utilises Frechet distance between the two image distributions, and assumes that the two are Gaussian. This is applied to 2D assets [7], [64], [105], [106], [107], [108], [109], 3D assets via 3D classifiers [37], [110] or rasterisation to 2D form [111], [112].The Frechet point cloud distance extends FID for applications in assessing the similarity of pointbased 3D shapes [113]. This has been used to evaluate many deep-learning based 3D point-cloud [71], [113], [114] and mesh [67] generators.…”
Section: Perceptual Similarity Metricsmentioning
confidence: 99%
“…Synthetic data generation and augmentation using generative models might be a promising alternative. Generative adversarial networks (GAN) can produce high-quality images with a high level of realism [Karras et al 2019] with the added benefit of controlling the generated data by editing latent input vectors or conditioning the model on some input information [Shu et al 2021].…”
Section: Introductionmentioning
confidence: 99%