Detecting and Removing Text in the Wild

Cho, Junho; Yun, Sangdoo; Han, Dongil; Heo, Byeongho; Choi, Jin Young

doi:10.1109/access.2021.3110293

Cited by 9 publications

(2 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, learning-based techniques such as deep convolutional neural networks (CNNs) and generative adversarial networks (GANs) have been widely used for a variety of image inpainting tasks, such as eliminating objects [7,8], noises [9], texts [10], and masks [11]. Usually, the proposed CNN-based methods are classified into three categories including coarse-to-fine, coarse-andfine, and structural guidance-based methods.…”

Section: Facementioning

confidence: 99%

E2f-Net: Eyes-to-Face Inpainting Via Stylegan Latent Space

Hassanpour,

Jamalbafrani,

Yang

et al. 2023

Preprint

View full text Add to dashboard Cite

Section: Facementioning

confidence: 99%

E2f-Net: Eyes-to-Face Inpainting Via Stylegan Latent Space

Hassanpour,

Jamalbafrani,

Yang

et al. 2023

Preprint

View full text Add to dashboard Cite

“…MTRNet++ [39] shared the same spirit with EraseNet, but separately encoded the image content and text mask in two branches. Cho et al [6] proposed to jointly predict the text stroke and inpaint the background, allowing the model to focus only on the restoration of text stroke regions. Wang et al [48] presented PERT, which contained a novel progressive structure with shared parameters to remove text more thoroughly, and a region-based modification strategy to effectively guide the erasure process only on text regions.…”

Section: Related Workmentioning

confidence: 99%

Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context

Liu¹,

Jin²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

Text removal has attracted increasingly attention due to its various applications on privacy protection, document restoration, and text editing. It has shown significant progress with deep neural network. However, most of the existing methods often generate inconsistent results for complex background. To address this issue, we propose a Contextual-guided Text Removal Network, termed as CTRNet. CTRNet explores both low-level structure and high-level discriminative context feature as prior knowledge to guide the process of text erasure and background restoration. We further propose a Local-global Content Modeling (LGCM) block with CNNs and Transformer-Encoder to capture local features and establish the long-term relationship among pixels globally. Finally, we incorporate LGCM with context guidance for feature modeling and decoding. Experiments on benchmark datasets, SCUT-EnsText and SCUT-Syn show that CTRNet significantly outperforms the existing state-of-the-art methods. Furthermore, a qualitative experiment on examination papers also demonstrates the generalizability of our method. The code of CTRNet is available at https://github.com/lcy0604/CTRNet.

show abstract