2020
DOI: 10.1007/s11432-019-2737-0
|View full text |Cite
|
Sign up to set email alerts
|

SynthText3D: synthesizing scene text images from 3D virtual worlds

Abstract: With the development of deep neural networks, the demand for a significant amount of annotated training data becomes the performance bottlenecks in many fields of research and applications. Image synthesis can generate annotated images automatically and freely, which gains increasing attention recently. In this paper, we propose to synthesize scene text images from the 3D virtual worlds, where the precise descriptions of scenes, editable illumination/visibility, and realistic physics are provided. Different fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 40 publications
(18 citation statements)
references
References 44 publications
0
15
0
Order By: Relevance
“…Our method aims at use cases involving creative self expression and augmented reality (e.g., photo-realistic translation, leveraging multi-lingual OCR technologies [59]). Our method can be used for data generation and augmentation for training future OCR systems, as successfully done by others [49], [60] and in other domains [61], [62]. We are aware, however, that like other technologies, ours can be misused, possibly the same as deepfake faces can be used for misinformation.…”
Section: Discussionmentioning
confidence: 87%
“…Our method aims at use cases involving creative self expression and augmented reality (e.g., photo-realistic translation, leveraging multi-lingual OCR technologies [59]). Our method can be used for data generation and augmentation for training future OCR systems, as successfully done by others [49], [60] and in other domains [61], [62]. We are aware, however, that like other technologies, ours can be misused, possibly the same as deepfake faces can be used for misinformation.…”
Section: Discussionmentioning
confidence: 87%
“…In STR, it has become a standard practice to use synthetic datasets. We introduce previous studies providing synthetic data generation algorithms for STR [5] and other tasks that can be exploited for STR [2,8,12].…”
Section: Related Workmentioning
confidence: 99%
“…However, the use of off-the-shelf segmentation techniques for text-background alignment can produce erroneous predictions and result in unrealistic text images. Recent studies like SynthText3D [8] and UnrealText [12] address this problem by synthesizing images with 3D graphic engines. Experiment results show that text detection performance can be notably improved by using synthesized text images without text alignment error.…”
Section: Related Workmentioning
confidence: 99%
“…As a result of the rapid development of computer vision [2,18,24,28,29], methods based on the GAN model have been widely explored for text-to-image synthesis tasks. Reed et al [12] made initial attempts to conduct text-to-image synthesis with the conditional DCGAN model.…”
Section: Text-to-image Synthesismentioning
confidence: 99%