2020
DOI: 10.3390/app10175729
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing U-Net with Spatial-Channel Attention Gate for Abnormal Tissue Segmentation in Medical Imaging

Abstract: In recent years, deep learning has dominated medical image segmentation. Encoder-decoder architectures, such as U-Net, can be used in state-of-the-art models with powerful designs that are achieved by implementing skip connections that propagate local information from an encoder path to a decoder path to retrieve detailed spatial information lost by pooling operations. Despite their effectiveness for segmentation, these naïve skip connections still have some disadvantages. First, multi-scale skip connections t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
42
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 66 publications
(43 citation statements)
references
References 30 publications
1
42
0
Order By: Relevance
“…In addition, the t-score method performed worse on our testing dataset (Dice about 0.37) than what is described by the developers 14 (Dice about 0.5) Therefore, the classical t-score method was considered insufficiently efficient to segment lesions in our large and heterogeneous clinical dataset. The architecture of our proposed 3D DAGMNet, depicted in Figure 2, is equipped with intra skip connections as UNet3+ 24 , fused multi-scale contextual information block, deep supervision, L1-regularization on final predicts, Dual Attention Gate (DAG) [25][26][27] , self-normalized activation (SeLU) 42 , and batch normalization. The details of the important components and training techniques/parameters are outlined in the following subsections.…”
Section: Modified C-fuzzy Methods 15mentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, the t-score method performed worse on our testing dataset (Dice about 0.37) than what is described by the developers 14 (Dice about 0.5) Therefore, the classical t-score method was considered insufficiently efficient to segment lesions in our large and heterogeneous clinical dataset. The architecture of our proposed 3D DAGMNet, depicted in Figure 2, is equipped with intra skip connections as UNet3+ 24 , fused multi-scale contextual information block, deep supervision, L1-regularization on final predicts, Dual Attention Gate (DAG) [25][26][27] , self-normalized activation (SeLU) 42 , and batch normalization. The details of the important components and training techniques/parameters are outlined in the following subsections.…”
Section: Modified C-fuzzy Methods 15mentioning
confidence: 99%
“…Further developments of UNet variants, such as Mnet, DenseUnet, Unet++, and Unet3+ [22][23][24] optimized the features utilization. The emergence of attention-gate techniques [25][26][27] conditioned networks to focus on local semantical features. Recent studies applied "attention UNets", for example, to predict final ischemic lesions from baseline MRIs 28,29 .…”
Section: Introductionmentioning
confidence: 99%
“…However, over the past few years supervised learning for biomedical images segmentation has managed to achieve a human-level performance that is very promising [101]. In particular, the U-Net architecture and training strategy [103] which was originally proposed to deal with the lower number of samples commonly found in biomedical domains has been successfully applied to many segmentation problems, including abnormal tissue segmentation [104], organ segmentation in CT images [105], and tumour segmentation in brain MRI [106].…”
Section: Segmentationmentioning
confidence: 99%
“…The use of attention mechanisms and saliency masks have gained some traction in this area, as they provide a way to visualize what region of an image was attended to that led to the predicted outcome. They were employed recently in models used to screen chest X-rays of COVID-19 patients [119], predict lung module malignancy from longitudinal CT [120], perform abnormal tissue segmentation in natural, CT and MRI images [104] and quantification of knee osteoarthrisis in X-ray images [121], while the MDNet model of Zhang et al [109] used attention mechanisms to indicate which area of the image corresponded to the text in the generated diagnostic report.…”
Section: Model Interpretabilitymentioning
confidence: 99%
“…The self-attention focused on the response of positions in a sequence while CCBAM focuses on cross-channel and spatial information of feature maps. Another related work is the spatial-channel gating scheme proposed for the U-Net structure [18], which addressed medical image segmentation problem. [19] also applied a concurrent space-channel-wise attention to the redundant convolutional encoder-decoder (RCED) for speech enhancement.…”
Section: Introductionmentioning
confidence: 99%