2023
DOI: 10.21037/qims-22-544
|View full text |Cite
|
Sign up to set email alerts
|

UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network

Abstract: Background: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution.Methods: We… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…Similarly, MRFormer optimises capturing the target and its surrounding context by embedding multi-head attention mechanisms and residual depth-wise convolutional networks within the skip connections. Fang et al demonstrate that MR-Former outperforms the basic U-Net model in pancreatic CT image segmentation [47]. Additionally, MDAG-Net incorporates a novel multi-dimensional attention module (MDAG) to bolster skip connections, capturing context information and precisely defining target features more effectively [48].…”
Section: Skip Connection Variationsmentioning
confidence: 99%
“…Similarly, MRFormer optimises capturing the target and its surrounding context by embedding multi-head attention mechanisms and residual depth-wise convolutional networks within the skip connections. Fang et al demonstrate that MR-Former outperforms the basic U-Net model in pancreatic CT image segmentation [47]. Additionally, MDAG-Net incorporates a novel multi-dimensional attention module (MDAG) to bolster skip connections, capturing context information and precisely defining target features more effectively [48].…”
Section: Skip Connection Variationsmentioning
confidence: 99%
“…In response to the above problems, various methods based on deep learning [7][8][9][10] have become the mainstream in the field of medical image segmentation due to their powerful feature learning, endto-end training, adaptability and generalization. At present, many researchers focus on fully convolutional networks (FCN) [11] and U-shaped structures [12][13][14][15], which usually employ encoder-decoder frameworks and thus perform well in simulating local features of images. However, due to the limited receptive field of convolution operations, these methods are often difficult to capture the global dependence of features, which is particularly important in semantic segmentation.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, in order to retain the features of small targets in the deep network, previous studies have also adopted the squeezing and excitation module for various image-processing tasks, which has improved the segmentation effect compared with that of the state-of-the art model ( 18 , 19 ). In addition, researchers have further extended the capabilities of the original U-Net by combining U-Net with Transformer in order to fully utilize the low-level features, which in turn can enhance the global features and reduce the semantic gap between the encoding and decoding stages ( 20 ). However, these methods suffer from the problems of excessive parameter amount, with the large number of samples required for training the model hindering convergence, which limits the scope of potential applications in radiotherapy image segmentation.…”
Section: Introductionmentioning
confidence: 99%