2024
DOI: 10.1038/s41598-024-52703-2
|View full text |Cite
|
Sign up to set email alerts
|

Efficient pneumonia detection using Vision Transformers on chest X-rays

Sukhendra Singh,
Manoj Kumar,
Abhay Kumar
et al.

Abstract: Pneumonia is a widespread and acute respiratory infection that impacts people of all ages. Early detection and treatment of pneumonia are essential for avoiding complications and enhancing clinical results. We can reduce mortality, improve healthcare efficiency, and contribute to the global battle against a disease that has plagued humanity for centuries by devising and deploying effective detection methods. Detecting pneumonia is not only a medical necessity but also a humanitarian imperative and a technologi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(2 citation statements)
references
References 56 publications
(18 reference statements)
0
2
0
Order By: Relevance
“…Singh et al fine-tuned the DEIT_B model pretrained on the ImageNet dataset. We replicated the results using the same official data split as ours, yielding an AUC of 0.995 and an accuracy of 94.50% 31 . Nisho et al fine-tuned the EfficientNet with the noisy student network pretrained on the ImageNet dataset.…”
Section: Discussionmentioning
confidence: 91%
“…Singh et al fine-tuned the DEIT_B model pretrained on the ImageNet dataset. We replicated the results using the same official data split as ours, yielding an AUC of 0.995 and an accuracy of 94.50% 31 . Nisho et al fine-tuned the EfficientNet with the noisy student network pretrained on the ImageNet dataset.…”
Section: Discussionmentioning
confidence: 91%
“…However, its application in medical image processing is still modest compared with that in other fields. Several medical imaging research studies have utilized the ViT structure in various domains, including esophageal endoscopic detection, pneumonia detection, MRI imaging, skin cancer detection, and tumor classification [ 44 , 45 , 46 , 47 ]. It is evident that image subjects amenable to ViT application are typically characterized by high spatial information correlation in sequence form, making them well suited for the incorporation of transformer modules such as the self-attention mechanism and position encoding, as shown in Figure 4 .…”
Section: Methodsmentioning
confidence: 99%