Chest L-Transformer: Local Features With Position Attention for Weakly Supervised Chest Radiograph Segmentation and Classification

Gu, Hong; Wang, Hongyu; Qin, Pan; Wang, Jia

doi:10.3389/fmed.2022.923456

Cited by 3 publications

(6 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Krishnan and colleagues [28] fine-tuned the transformer to classify X-rays images [53,54]. Gu et al designed a model called Chest L-Transformer [30] to classify chest X-ray images using the SIIM-ACR pneumothorax dataset [55]. The proposed model is composed of a backbone block based on the ResNeXt [56], a position attention block, and a classifier.…”

Section: Classificationmentioning

confidence: 99%

“…The outputs of the transformers are passed through a weighted fusion layer. [30] 2022 X-ray chest pneumothorax [55] hybrid framework [32] 2021 X-ray chest tuberculosis [57], COVID-19 [58], thorax diseases [59],…”

Section: Classificationmentioning

confidence: 99%

“…The current frame is fed into the CNN and transformer while the previous frame is fed into the CNN only. The Chest L-Transformer [30] introduced in Table 1 for image classification are also designed for segmentation.…”

Section: Segmentationmentioning

confidence: 99%

“…Due to these, the DL method has been widely applied in the field of medical image analysis to reduce inter-reader variation as well as reduce time and manpower costs. The transformerbased method has been widely used in medical image analysis either using the transformer solely [27,28,29] or hybridizing CNN and transformer to capture both local and global information [30,31,32]. However, an insightful and critical review of transformer-based medical image analysis is absent.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Recent Progress in Transformer-based Medical Image Analysis

Liu¹,

Shen²

2022

Preprint

View full text Add to dashboard Cite

The transformer has dominated the natural language processing (NLP) field for a long time. Recently, the transformer-based method has been adopted into the computer vision (CV) field and shows promising results. As an important branch of the CV field, medical image analysis joins the wave of the transformer-based method rightfully. In this review, we illustrate the principle of the attention mechanism, and the detailed structures of the transformer, and depict how the transformer is adopted into medical image analysis. We organize the transformer-based medical image analysis applications in a sequence of different tasks, including classification, segmentation, synthesis, registration, localization, detection, captioning, and denoising. For the mainstream classification and segmentation tasks, we further divided the corresponding works based on different medical imaging modalities. The datasets corresponding to the related works are also organized. We include thirteen modalities and more than twenty objects in our work.

show abstract

Section: Classificationmentioning

confidence: 99%

Section: Classificationmentioning

confidence: 99%

Section: Segmentationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Recent Progress in Transformer-based Medical Image Analysis

Liu¹,

Shen²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In recent years, the appearance of the transformer (Arnab et al, 2021 ; Chen et al, 2021 ; Wang and Wang, 2022 ) has provided a new solution for vision tasks. Compared with traditional CNN (Gu et al, 2022 ) and RNN-based (Lin et al, 2022 ) methods, transformers have better capability to understand shape and geometry and capture the dependencies between long distances. We propose a spatial-temporal texture transformer network (Han et al, 2020 ).…”

Section: Related Studiesmentioning

confidence: 99%

Learning a spatial-temporal texture transformer network for video inpainting

Xue

2022

Front. Neurorobot.

View full text Add to dashboard Cite

We study video inpainting, which aims to recover realistic textures from damaged frames. Recent progress has been made by taking other frames as references so that relevant textures can be transferred to damaged frames. However, existing video inpainting approaches neglect the ability of the model to extract information and reconstruct the content, resulting in the inability to reconstruct the textures that should be transferred accurately. In this paper, we propose a novel and effective spatial-temporal texture transformer network (STTTN) for video inpainting. STTTN consists of six closely related modules optimized for video inpainting tasks: feature similarity measure for more accurate frame pre-repair, an encoder with strong information extraction ability, embedding module for finding a correlation, coarse low-frequency feature transfer, refinement high-frequency feature transfer, and decoder with accurate content reconstruction ability. Such a design encourages joint feature learning across the input and reference frames. To demonstrate the advancedness and effectiveness of the proposed model, we conduct comprehensive ablation learning and qualitative and quantitative experiments on multiple datasets by using standard stationary masks and more realistic moving object masks. The excellent experimental results demonstrate the authenticity and reliability of the STTTN.

show abstract

Recent advances of Transformers in medical image analysis: A comprehensive review

Xia

Wang

2023

MedComm – Future Medicine

View full text Add to dashboard Cite

Recent works have shown that Transformer's excellent performances on natural language processing tasks can be maintained on natural image analysis tasks. However, the complicated clinical settings in medical image analysis and varied disease properties bring new challenges for the use of Transformer. The computer vision and medical engineering communities have devoted significant effort to medical image analysis research based on Transformer with especial focus on scenario-specific architectural variations.In this paper, we comprehensively review this rapidly developing area by covering the latest advances of Transformer-based methods in medical image analysis of different settings. We first give introduction of basic mechanisms of Transformer including implementations of selfattention and typical architectures. The important research problems in various medical image data modalities, clinical visual tasks, organs and diseases are then reviewed systemically. We carefully collect 276 very recent works and 76 public medical image analysis datasets in an organized structure. Finally, discussions on open problems and future research directions are also provided. We expect this review to be an up-to-date roadmap and serve as a reference source in pursuit of boosting the development of medical image analysis field.

show abstract

Chest L-Transformer: Local Features With Position Attention for Weakly Supervised Chest Radiograph Segmentation and Classification

Cited by 3 publications

References 28 publications

Recent Progress in Transformer-based Medical Image Analysis

Recent Progress in Transformer-based Medical Image Analysis

Learning a spatial-temporal texture transformer network for video inpainting

Recent advances of Transformers in medical image analysis: A comprehensive review

Contact Info

Product

Resources

About