Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images

Laiton-Bonadiez, Camilo; Sánchez-Torres, Germán; Bedoya, John William Branch

doi:10.3390/s22072559

Cited by 13 publications

(7 citation statements)

References 85 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Following further validation using CT data from other organs, this hybrid approach can potentially have applicability in terms of detecting small and irregular lesions across different diseases and organ areas. Other studies focused on segmentation of large-scale MRI data [58][59][60][61][62].…”

Section: Segmentationmentioning

confidence: 99%

Is Attention all You Need in Medical Image Analysis? A Review

Papanastasiou,

Dikaios,

Huang

et al. 2024

IEEE J. Biomed. Health Inform.

View full text Add to dashboard Cite

Medical imaging is a key component in clinical diagnosis, treatment planning and clinical trial design, accounting for almost 90% of all healthcare data. CNNs achieved performance gains in medical image analysis (MIA) over the last years. CNNs can efficiently model local pixel interactions and be trained on small-scale MI data. Despite their important advances, typical CNN have relatively limited capabilities in modelling "global" pixel interactions, which restricts their generalisation ability to understand out-ofdistribution data with different "global" information. The recent progress of Artificial Intelligence gave rise to Transformers, which can learn global relationships from data. However, full Transformer models need to be trained on large-scale data and involve tremendous computational complexity. Attention and Transformer compartments ("Transf/Attention") which can well maintain properties for modelling global relationships, have been proposed as lighter alternatives of full Transformers. Recently, there is an increasing trend to co-pollinate complementary local-global properties from CNN and Transf/Attention architectures, which led to a new era of hybrid models. The past years have witnessed substantial growth in hybrid CNN-Transf/Attention models across diverse MIA problems. In this systematic review, we survey existing hybrid CNN-Transf/Attention models, review and unravel key architectural designs, analyse breakthroughs, and evaluate current and future opportunities as well as challenges. We also introduced an analysis framework on generalisation opportunities of scientific and clinical impact, based on which new data-driven domain generalisation and adaptation methods can be stimulated.

show abstract

Section: Segmentationmentioning

confidence: 99%

Is Attention all You Need in Medical Image Analysis? A Review

Papanastasiou,

Dikaios,

Huang

et al. 2024

IEEE J. Biomed. Health Inform.

View full text Add to dashboard Cite

show abstract

“…Self-attention is a crucial component of the transformer, enabling the representation of the degree of impact as a correlation by shifting a single sequence to different sequences, thus handling the global receptive field intrinsically [23][24][25][26]. Furthermore, instead of updating the convolution filters as typically done in a CNN [27], the self-attention mechanism updates three matrices in parallel, namely query (Q), key (K), and value (V) vectors.…”

Section: Transformer Vt U-netmentioning

confidence: 99%

Empowering Vision Transformer by Optimal Network Hyper-Parameter Selection for Whole Pelvis Prostate Planning Target Volume Auto-Segmentation

Cho,

Lee,

Kim

et al. 2023

Preprint

View full text Add to dashboard Cite

U-Net, based on a deep convolutional neural network (CNN), has been clinically used to au-to-segment normal organs and potentially target volumes. However, CNNs with local geometric dependencies may limit the accuracy of segmentation. Additionally, the performance of CNNs can vary depending on the selection of network hyper-parameters, which was mitigated by the proposition of nnU-Net. We chose a vision transformer architecture called VT U-Net, which features a self-attention excluding the convolution layer, to overcome the limitations of CNNs by utilizing global geometric information of images. The VT U-Net v.2 became more powerful thanks to the adaptive hyper-parameter optimizer embedded in nnU-Net. However, despite leveraging the benefits of nnU-Net, VT U-Net v.2 still had additional network hyper-parameters that needed to be optimally chosen. Accordingly, among various hyper-parameters, this study attempted to find the optimal combination of the patch size and the embedded dimension regarding the transformer. From the 4-fold cross-validation, the modified VT U-Net v.2 showed the highest average performance for planning target volume (PTV) segmentation among the investigated networks. Though nnU-Net was based on convolution layers, the adaptive hyper-parameter optimizers turned out to enhance the performance. It was also confirmed that network hyper-parameters affected the segmentation accuracy of vision transformers.

show abstract

“…Also, a highly appropriate SIMD algorithm that operates on thousands of threads concurrently is executed by today's GPUs [9]. On the other hand, the most frequently used medical imaging modality for brain imaging is magnetic resonance imaging (MRI), followed by computed tomography (CT), positron emission tomography (PET), and ultrasound [10][11][12][13]. In basic terms, MRI has been widely utilized to analyze the anatomy of the entire brain [14].…”

Section: Introductionmentioning

confidence: 99%

GPU-Based Parallel Processing Techniques for Enhanced Brain Magnetic Resonance Imaging Analysis: A Review of Recent Advances

Kirimtat,

Krejcar

2024

Sensors

View full text Add to dashboard Cite

The approach of using more than one processor to compute in order to overcome the complexity of different medical imaging methods that make up an overall job is known as GPU (graphic processing unit)-based parallel processing. It is extremely important for several medical imaging techniques such as image classification, object detection, image segmentation, registration, and content-based image retrieval, since the GPU-based parallel processing approach allows for time-efficient computation by a software, allowing multiple computations to be completed at once. On the other hand, a non-invasive imaging technology that may depict the shape of an anatomy and the biological advancements of the human body is known as magnetic resonance imaging (MRI). Implementing GPU-based parallel processing approaches in brain MRI analysis with medical imaging techniques might be helpful in achieving immediate and timely image capture. Therefore, this extended review (the extension of the IWBBIO2023 conference paper) offers a thorough overview of the literature with an emphasis on the expanding use of GPU-based parallel processing methods for the medical analysis of brain MRIs with the imaging techniques mentioned above, given the need for quicker computation to acquire early and real-time feedback in medicine. Between 2019 and 2023, we examined the articles in the literature matrix that include the tasks, techniques, MRI sequences, and processing results. As a result, the methods discussed in this review demonstrate the advancements achieved until now in minimizing computing runtime as well as the obstacles and problems still to be solved in the future.

show abstract

Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images

Cited by 13 publications

References 85 publications

Is Attention all You Need in Medical Image Analysis? A Review

Is Attention all You Need in Medical Image Analysis? A Review

Empowering Vision Transformer by Optimal Network Hyper-Parameter Selection for Whole Pelvis Prostate Planning Target Volume Auto-Segmentation

GPU-Based Parallel Processing Techniques for Enhanced Brain Magnetic Resonance Imaging Analysis: A Review of Recent Advances

Contact Info

Product

Resources

About