Enhancing Mask Transformer with Auxiliary Convolution Layers for Semantic Segmentation

Xia, Zhengyu; Kim, Joohee

doi:10.3390/s23020581

Sensors

2023

DOI: 10.3390/s23020581

|View full text |Cite

Enhancing Mask Transformer with Auxiliary Convolution Layers for Semantic Segmentation

Zhengyu Xia

Joohee Kim

Abstract: Transformer-based semantic segmentation methods have achieved excellent performance in recent years. Mask2Former is one of the well-known transformer-based methods which unifies common image segmentation into a universal model. However, it performs relatively poorly in obtaining local features and segmenting small objects due to relying heavily on transformers. To this end, we propose a simple yet effective architecture that introduces auxiliary branches to Mask2Former during training to capture dense local fe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

2024

Publication Types

Select...

Article4

Relationship

Self Cite0

Independent4

Authors

Journals

Cited by 4 publications

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Enhancing Mask Transformer with Auxiliary Convolution Layers for Semantic Segmentation

Cited by 4 publications

References 39 publications

Parotid Gland Segmentation Using Purely Transformer-Based U-Shaped Network and Multimodal MRI

Parotid Gland Segmentation Using Purely Transformer-Based U-Shaped Network and Multimodal MRI

A survey of the vision transformers and their CNN-transformer based variants

Application of deep-learning to the automatic segmentation and classification of lateral lymph nodes on ultrasound images of papillary thyroid carcinoma

Contact Info

Product

Resources

About