CoT-UNet++: A medical image segmentation method based on contextual transformer and dense connection

Yin, Yijun; Xu, W. Z.; Chen, Lei; Wu, Hao

doi:10.3934/mbe.2023364

Cited by 9 publications

(4 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Yin et al 23 proposed CoT-UNet++, an improved network based on the TransUNet 76 network architecture, which consists of a hybrid encoder, dense connections, and a decoder. CoT-UNet++ uses a hybrid encoder to obtain neighboring context information for CoTNet 77 coding and global context for Transformer coding.…”

Section: D Tooth Segmentation Methods Based On Transformersmentioning

confidence: 99%

“…99 Transformer was initially the top network model in natural language processing (NLP), and has also shown excellent performance when it applied to image tasks. Several works 23,24 have used the Transformer attention mechanism to achieve better tooth segmentation.…”

Section: Overview Of Deep Learning Methods For Tooth Segmentationmentioning

confidence: 99%

“…Transformer takes full advantage of the multi-head attention mechanism, which achieves a performance improvement over many previous deep learning models based on CNNs. Some works [22][23][24] realized tooth segmentation with transformer as the backbone module. Xiong et al 24 achieved a 96% dice score on a private dataset containing 16,000 3D dental models.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Deep learning-based tooth segmentation methods in medical imaging: A review

Chen,

Ma,

et al. 2024

Proc Inst Mech Eng H

View full text Add to dashboard Cite

Deep learning approaches for tooth segmentation employ convolutional neural networks (CNNs) or Transformers to derive tooth feature maps from extensive training datasets. Tooth segmentation serves as a critical prerequisite for clinical dental analysis and surgical procedures, enabling dentists to comprehensively assess oral conditions and subsequently diagnose pathologies. Over the past decade, deep learning has experienced significant advancements, with researchers introducing efficient models such as U-Net, Mask R-CNN, and Segmentation Transformer (SETR). Building upon these frameworks, scholars have proposed numerous enhancement and optimization modules to attain superior tooth segmentation performance. This paper discusses the deep learning methods of tooth segmentation on dental panoramic radiographs (DPRs), cone-beam computed tomography (CBCT) images, intro oral scan (IOS) models, and others. Finally, we outline performance-enhancing techniques and suggest potential avenues for ongoing research. Numerous challenges remain, including data annotation and model generalization limitations. This paper offers insights for future tooth segmentation studies, potentially facilitating broader clinical adoption.

show abstract

Section: D Tooth Segmentation Methods Based On Transformersmentioning

confidence: 99%

Section: Overview Of Deep Learning Methods For Tooth Segmentationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep learning-based tooth segmentation methods in medical imaging: A review

Chen,

Ma,

et al. 2024

Proc Inst Mech Eng H

View full text Add to dashboard Cite

show abstract

“…Wang et al [8] argued that TransUNet, similar to the UNet architectures, has skip connections at the same level, which constrains feature fusion. Instead, new model architectures such as MS-TransUNet++ [8] and CoT-UNet++ have been proposed [47]. These models use dense skip connections between the encoder and decoder at different levels to improve feature fusion, similar to UNet++, in addition to a hybrid encoder that includes a Transformer.…”

Section: Introductionmentioning

confidence: 99%

Improved UNet with Attention for Medical Image Segmentation

AL Qurri,

Almekkawy

2023

Sensors

View full text Add to dashboard Cite

Medical image segmentation is crucial for medical image processing and the development of computer-aided diagnostics. In recent years, deep Convolutional Neural Networks (CNNs) have been widely adopted for medical image segmentation and have achieved significant success. UNet, which is based on CNNs, is the mainstream method used for medical image segmentation. However, its performance suffers owing to its inability to capture long-range dependencies. Transformers were initially designed for Natural Language Processing (NLP), and sequence-to-sequence applications have demonstrated the ability to capture long-range dependencies. However, their abilities to acquire local information are limited. Hybrid architectures of CNNs and Transformer, such as TransUNet, have been proposed to benefit from Transformer’s long-range dependencies and CNNs’ low-level details. Nevertheless, automatic medical image segmentation remains a challenging task due to factors such as blurred boundaries, the low-contrast tissue environment, and in the context of ultrasound, issues like speckle noise and attenuation. In this paper, we propose a new model that combines the strengths of both CNNs and Transformer, with network architectural improvements designed to enrich the feature representation captured by the skip connections and the decoder. To this end, we devised a new attention module called Three-Level Attention (TLA). This module is composed of an Attention Gate (AG), channel attention, and spatial normalization mechanism. The AG preserves structural information, whereas channel attention helps to model the interdependencies between channels. Spatial normalization employs the spatial coefficient of the Transformer to improve spatial attention akin to TransNorm. To further improve the skip connection and reduce the semantic gap, skip connections between the encoder and decoder were redesigned in a manner similar to that of the UNet++ dense connection. Moreover, deep supervision using a side-output channel was introduced, analogous to BASNet, which was originally used for saliency predictions. Two datasets from different modalities, a CT scan dataset and an ultrasound dataset, were used to evaluate the proposed UNet architecture. The experimental results showed that our model consistently improved the prediction performance of the UNet across different datasets.

show abstract

Computed tomography image segmentation of irregular cerebral hemorrhage lesions based on improved U-Net

Yu-long

Zeng

et al. 2023

Journal of Radiation Research and Applied Sciences

View full text Add to dashboard Cite

CoT-UNet++: A medical image segmentation method based on contextual transformer and dense connection

Cited by 9 publications

References 25 publications

Deep learning-based tooth segmentation methods in medical imaging: A review

Deep learning-based tooth segmentation methods in medical imaging: A review

Improved UNet with Attention for Medical Image Segmentation

Computed tomography image segmentation of irregular cerebral hemorrhage lesions based on improved U-Net

Contact Info

Product

Resources

About