SE-GAN: Skeleton Enhanced GAN-based Model for Brush Handwriting Font Generation

Yuan, Shuai; Liu, Ruixue; Chen, Meng; Chen, Baoyang; Qiu, Zhijie; He, Xiaodong

doi:10.48550/arxiv.2204.10484

Cited by 3 publications

(8 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As certain a type of artificial images, the components of Chinese characters such as strokes, radicals and skeletons are closely related to the font styles and structures of Chinese characters. Thus, incorporating these components into the generation of Chinese fonts has attracted an amount of attention in the past decade [10], [14], [21], [22], [23], [24], [25], [26], [27]. In the early stage, the Chinese font generation models are mainly based on the handcrafted explicit features such as strokes and radicals [21], [22], [23].…”

Section: Related Workmentioning

confidence: 99%

“…With the development of deep learning, some components of Chinese characters such as strokes, radicals and skeletons have been usually extracted by some deep neural networks and incorporated into the GAN model as certain important supervision information [10], [14], [24], [25], [26], [27]. In [24], the authors first divided Chinese characters into strokes by adopting certain a coherent point drift algorithm and then generated new font strokes by fusing the styles of two existing font strokes and further yielded new fonts by assembling them.…”

Section: Related Workmentioning

confidence: 99%

“…With the emergence of the generative adversarial network (GAN) [6], the kind of models based on GAN become the mainstream for the generation of Chinese fonts. These models can be generally divided into two categories, i.e., the class of models based on the paired data (that is, there is a one-to-one correspondence between the source and target font domains as shown in Figure 2(c)) [7], [8], [9], [10], [11], and the class of models based on the unpaired data (that is, such a one-to-one correspondence is not required as shown in Figure 2(d)) [12], [13], [14], [15], [16].…”

Section: Introductionmentioning

confidence: 99%

“…In [9], the authors proposed an effective model called CalliGAN by incorporating some kinds of component information such as strokes of Chinese characters, where several auxiliary network modules were introduced to extract these kinds of component information. In the recent paper [10], the authors proposed a novel GAN-based image translation model by integrating the skeleton information for the generation of Chinese brush handwriting font. In [11], the authors proposed a font generation method that learns localized styles, namely component-wise style representations.…”

Section: Introductionmentioning

confidence: 99%

“…In [11], the authors proposed a font generation method that learns localized styles, namely component-wise style representations. Although the performance of this kind of model based on paired data in the literature [7], [8], [9], [10], [11] is impressive, the collection of extensive paired samples is generally labour-intensive and expensive. Particularly, in some font generation tasks such as the generation of ancient calligraphy fonts [4], [5], it is hard to yield extensive paired samples.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Zeng

Chen

Liu

et al. 2021

AAAI

View full text Add to dashboard Cite

The generation of stylish Chinese fonts is an important problem involved in many applications. Most of existing generation methods are based on the deep generative models, particularly, the generative adversarial networks (GAN) based models. However, these deep generative models may suffer from the mode collapse issue, which significantly degrades the diversity and quality of generated results. In this paper, we introduce a one-bit stroke encoding to capture the key mode information of Chinese characters and then incorporate it into CycleGAN, a popular deep generative model for Chinese font generation. As a result we propose an efficient method called StrokeGAN, mainly motivated by the observation that the stroke encoding contains amount of mode information of Chinese characters. In order to reconstruct the one-bit stroke encoding of the associated generated characters, we introduce a stroke-encoding reconstruction loss imposed on the discriminator. Equipped with such one-bit stroke encoding and stroke-encoding reconstruction loss, the mode collapse issue of CycleGAN can be significantly alleviated, with an improved preservation of strokes and diversity of generated characters. The effectiveness of StrokeGAN is demonstrated by a series of generation tasks over nine datasets with different fonts. The numerical results demonstrate that StrokeGAN generally outperforms the state-of-the-art methods in terms of content and recognition accuracies, as well as certain stroke error, and also generates more realistic characters.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Zeng

Chen

Liu

et al. 2021

AAAI

View full text Add to dashboard Cite

show abstract

Detecting Tool Keypoints with Synthetic Training Data

Vanherle

Put

Michiels

et al. 2022

Communications in Computer and Information Science

View full text Add to dashboard Cite

Styled Handwritten Text Generation (HTG) has received significant attention in recent years, propelled by the success of learning-based solutions employing GANs, Transformers, and, preliminarily, Diffusion Models. Despite this surge in interest, there remains a critical yet understudied aspect -the impact of the input, both visual and textual, on the HTG model training and its subsequent influence on performance. This study delves deeper into a cutting-edge Styled-HTG approach, proposing strategies for input preparation and training regularization that allow the model to achieve better performance and generalize better. These aspects are validated through extensive analysis on several different settings and datasets. Moreover, in this work, we go beyond performance optimization and address a significant hurdle in HTG research -the lack of a standardized evaluation protocol. In particular, we propose a standardization of the evaluation protocol for HTG and conduct a comprehensive benchmarking of existing approaches. By doing so, we aim to establish a foundation for fair and meaningful comparisons between HTG strategies, fostering progress in the field.

show abstract

DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models

Liu,

Lian

2024

AAAI

View full text Add to dashboard Cite

Few-shot font generation, especially for Chinese calligraphy fonts, is a challenging and ongoing problem. With the help of prior knowledge that is mainly based on glyph consistency assumptions, some recently proposed methods can synthesize high-quality Chinese glyph images. However, glyphs in calligraphy font styles often do not meet these assumptions. To address this problem, we propose a novel model, DeepCalliFont, for few-shot Chinese calligraphy font synthesis by integrating dual-modality generative models. Specifically, the proposed model consists of image synthesis and sequence generation branches, generating consistent results via a dual-modality representation learning strategy. The two modalities (i.e., glyph images and writing sequences) are properly integrated using a feature recombination module and a rasterization loss function. Furthermore, a new pre-training strategy is adopted to improve the performance by exploiting large amounts of uni-modality data. Both qualitative and quantitative experiments have been conducted to demonstrate the superiority of our method to other state-of-the-art approaches in the task of few-shot Chinese calligraphy font synthesis. The source code can be found at https://github.com/lsflyt-pku/DeepCalliFont.

show abstract

SE-GAN: Skeleton Enhanced GAN-based Model for Brush Handwriting Font Generation

Cited by 3 publications

References 20 publications

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Detecting Tool Keypoints with Synthetic Training Data

DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models

Contact Info

Product

Resources

About