Activation in the VTA and nucleus accumbens increases in anticipation of both gains and losses

Finetuning a pretrained backbone in the encoder part of an image transformer network has been the traditional approach for the semantic segmentation task. However, such an approach leaves out the semantic context that an image provides during the encoding stage. This paper argues that incorporating semantic information of the image into pretrained hierarchical transformer-based backbones while finetuning improves the performance considerably. To achieve this, we propose SeMask, a simple and effective framework that incorporates semantic information into the encoder with the help of a semantic attention operation. In addition, we use a lightweight semantic decoder during training to provide supervision to the intermediate semantic prior maps at every stage. Our experiments demonstrate that incorporating semantic priors enhances the performance of the established hierarchical encoders with a slight increase in the number of FLOPs. We provide empirical proof by integrating Se-Mask into each variant of the Swin-Transformer as our encoder paired with different decoders. Our framework achieves a new state-of-the-art of 58.22% mIoU on the ADE20K dataset and improvements of over 3% in the mIoU metric on the Cityscapes dataset. The code and checkpoints are publicly available at https://github.com/Picsart-AI-Research/SeMask-Segmentation.

show abstract

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Jain¹,

Zhou²,

Yu³

2022

Preprint

View full text Add to dashboard Cite

github.io/fcf-inpainting/ LaMa Ours CoModGAN Ours Image with Holes Image with Holes Geometry Structures and Object Boundary Appearance Textures and Repeating PatternsFigure 1: The most challenging issues for advanced image inpainting algorithms fall on generating better structures and textures. Left: LaMa [27] works well for repeating textures but generates fading out boundaries and structures when the holes get larger. Right: CoModGAN [43] with a StyleGAN-based [13] generator achieves impressive geometry structures, but it fails to reuse textures within the image to generate plausible repeating patterns. Our model generates good structures and textures simultaneously better than any state-of-the-arts.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jitesh Jain

SeMask: Semantically Masked Transformers for Semantic Segmentation

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Contact Info

Product

Resources

About