2021
DOI: 10.48550/arxiv.2103.10504
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

UNETR: Transformers for 3D Medical Image Segmentation

Abstract: Fully Convolutional Neural Networks (FCNNs) with contracting and expansive paths (e.g. encoder and decoder) have shown prominence in various medical image segmentation applications during the recent years. In these architectures, the encoder plays an integral role by learning global contextual representations which will be further utilized for semantic output prediction by the decoder. Despite their success, the locality of convolutional layers , as the main building block of FCNNs limits the capability of lea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
90
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 65 publications
(90 citation statements)
references
References 44 publications
0
90
0
Order By: Relevance
“…Transformer-based models have recently gained a lot of attraction in computer vision [14,42,25] and medical image analysis [11,16]. Chen et al [11] introduced a 2D U-Net architecture that benefits from a ViT in the bottleneck of the network.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Transformer-based models have recently gained a lot of attraction in computer vision [14,42,25] and medical image analysis [11,16]. Chen et al [11] introduced a 2D U-Net architecture that benefits from a ViT in the bottleneck of the network.…”
Section: Related Workmentioning
confidence: 99%
“…In addition, ViTs have achieved success in effective learning of pretext tasks for self-supervised pre-training in various applications [9,8,36]. In medical image analysis, UNETR [16] is the first methodology that utilizes a ViT as its encoder without relying on a CNN-based feature extractor. Other approaches [40,39] have attempted to leverage the power of ViTs as a stand-alone block in their architectures which otherwise consist of CNN-based components.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Transformers in Medical Image Segmentation. A number of recent studies Cao et al, 2021;Xie et al, 2021b;Hatamizadeh et al, 2021;Valanarasu et al, 2021;Isensee et al, 2021;Zheng et al, 2020) (Cao et al, 2021) explored how to use a pure transformer for medical image analysis tasks. However, the results do not lead to better performance.…”
Section: Related Workmentioning
confidence: 99%
“…Transformers are particularly suited for this unique application, since aneurysms can be extremely small relative to a patient's overall brain mass, and they need to be carefully identified by type and characterized in fine detail to determine the appropriate medical or surgical intervention. Although a few studies have been published that use Transformer Neural Network models to perform tasks such as brain tumor segmentation, retinal blood vessel segmentation, and polyp segmentation in colonoscopy images [19][20][21][22][23][24][25][26], no studies to our knowledge have employed a Transformer architecture to detect aneurysms. This study proposes the use of a novel Transformer Deep Learning Neural Network approach to identify intracranial aneurysms in 3D MRA scans.…”
Section: Introductionmentioning
confidence: 99%