Proceedings of the 30th ACM International Conference on Multimedia 2022
DOI: 10.1145/3503161.3548348
|View full text |Cite
|
Sign up to set email alerts
|

Atrous Pyramid Transformer with Spectral Convolution for Image Inpainting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…The first transformer stacks self-attention layers and outperforms the best result of that time (Vaswani et al 2017). The idea of transformers inspires researchers to develop more transformer architectures (Huang and Zhang 2022). For example, a newly proposed model, DeiT III (Touvron, Cord, and Jégou 2022), is a variant of ViT that incorporates a new data enhancement procedure that includes Gaussian blur, solarization, and grayscale, and it achieves a competitive performance in image classification.…”
Section: Related Work Deep Learning Based Methodsmentioning
confidence: 99%
“…The first transformer stacks self-attention layers and outperforms the best result of that time (Vaswani et al 2017). The idea of transformers inspires researchers to develop more transformer architectures (Huang and Zhang 2022). For example, a newly proposed model, DeiT III (Touvron, Cord, and Jégou 2022), is a variant of ViT that incorporates a new data enhancement procedure that includes Gaussian blur, solarization, and grayscale, and it achieves a competitive performance in image classification.…”
Section: Related Work Deep Learning Based Methodsmentioning
confidence: 99%
“…Ren et al [23] used smoothed images without edges to train a structure reconstructor, which generated the structures of the missing areas and then a texture generator employed the reconstructed structures with an appearance flow to generate the final restored images. Huang et al [24] designed a two-stage approach based on a novel atrous pyramid transformer (APT) for image inpainting. The inpainting method first uses several layers of APT blocks to restore the semantic structures of images and then a dual spectral transform convolutional (DSTC) module is applied to work together with the APT to infer the textural details of damaged areas.…”
Section: Multistage Image Inpaintingmentioning
confidence: 99%