2022
DOI: 10.1002/int.22956
|View full text |Cite
|
Sign up to set email alerts
|

RT‐Unet: An advanced network based on residual network and transformer for medical image segmentation

Abstract: For the past several years, semantic segmentation method based on deep learning, especially Unet, have achieved tremendous success in medical image processing. The U-shaped topology of Unet can well solve image segmentation tasks. However, due to the limitation of traditional convolution operations, Unet cannot realize global semantic information interaction. To address this problem, this paper proposes RT-Unet, which combines the advantages of Transformer and Residual network for accurate medical segmentation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 24 publications
0
5
0
Order By: Relevance
“…The redundancy and long-distance interdependence between pixels in network learning are successfully resolved by UniFormer [28], which uses CNN and transformer to extract global and local information. Conformer [36] and RFTNet [37] improve the feature representation capacity of network learning by fusing the capabilities of transformer to create global associations with the capabilities of CNN to extract local features. The ability of network learning to represent features.…”
Section: Convolution and Transformermentioning
confidence: 99%
“…The redundancy and long-distance interdependence between pixels in network learning are successfully resolved by UniFormer [28], which uses CNN and transformer to extract global and local information. Conformer [36] and RFTNet [37] improve the feature representation capacity of network learning by fusing the capabilities of transformer to create global associations with the capabilities of CNN to extract local features. The ability of network learning to represent features.…”
Section: Convolution and Transformermentioning
confidence: 99%
“…In addition, one of the main object detection paradigms is the feature pyramid network [1,26,33,34], which fuses image features of diferent scales through the top-down and bottom-up paths. Tis way results in the bottom feature that can also share the rich semantic information with the top feature [35,36]. For instance, Bi-FPN [37] enhances the representation ability of features by adding residual connections to the original structure and removing nodes without feature fusion to reduce the amount of computation.…”
Section: Muti-scale Featuresmentioning
confidence: 99%
“…In the feld of medical image segmentation based on deep learning [8][9][10], the encoderdecoder structure is one of the most commonly used network structures [11][12][13]. UNet [7] and SegNet [14] are two representatives of the encoder-decoder structure-based methods.…”
Section: Encoder-decoder Structurementioning
confidence: 99%