Endoscopic Image Classification Based on Explainable Deep Learning

Doniyorjon, Mukhtorov; Madinakhon, Rakhmonova; Muksimova, Shakhnoza; Cho, Young Im

doi:10.3390/s23063176

Cited by 32 publications

(7 citation statements)

References 56 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They demonstrate that the model is capable of accurate and robust classification and that it is able to identify the relevant regions in the input images with high accuracy. These findings have important implications for the medical field, where accurate and reliable classification of medical images is critical for effective diagnosis and treatment [ 59 , 60 ].…”

Section: Resultsmentioning

confidence: 99%

Spatial-attention ConvMixer architecture for classification and detection of gastrointestinal diseases using the Kvasir dataset

Demirbaş,

Üzen,

Fırat

2024

Health Inf Sci Syst

View full text Add to dashboard Cite

Gastrointestinal (GI) disorders, encompassing conditions like cancer and Crohn’s disease, pose a significant threat to public health. Endoscopic examinations have become crucial for diagnosing and treating these disorders efficiently. However, the subjective nature of manual evaluations by gastroenterologists can lead to potential errors in disease classification. In addition, the difficulty of diagnosing diseased tissues in GI and the high similarity between classes made the subject a difficult area. Automated classification systems that use artificial intelligence to solve these problems have gained traction. Automatic detection of diseases in medical images greatly benefits in the diagnosis of diseases and reduces the time of disease detection. In this study, we suggested a new architecture to enable research on computer-assisted diagnosis and automated disease detection in GI diseases. This architecture, called Spatial-Attention ConvMixer (SAC), further developed the patch extraction technique used as the basis of the ConvMixer architecture with a spatial attention mechanism (SAM). The SAM enables the network to concentrate selectively on the most informative areas, assigning importance to each spatial location within the feature maps. We employ the Kvasir dataset to assess the accuracy of classifying GI illnesses using the SAC architecture. We compare our architecture’s results with Vanilla ViT, Swin Transformer, ConvMixer, MLPMixer, ResNet50, and SqueezeNet models. Our SAC method gets 93.37% accuracy, while the other architectures get respectively 79.52%, 74.52%, 92.48%, 63.04%, 87.44%, and 85.59%. The proposed spatial attention block improves the accuracy of the ConvMixer architecture on the Kvasir, outperforming the state-of-the-art methods with an accuracy rate of 93.37%.

show abstract

Section: Resultsmentioning

confidence: 99%

Spatial-attention ConvMixer architecture for classification and detection of gastrointestinal diseases using the Kvasir dataset

Demirbaş,

Üzen,

Fırat

2024

Health Inf Sci Syst

View full text Add to dashboard Cite

show abstract

“…An effective augmentation technique was employed to classify medical images using the heat map of classification results, which had an accuracy of 98.2% during training and 93.46% during validation ( Mukhtorov et al, 2023 ). The previous results on Gastrointestinal tracts demonstrate that the proposed model is outclassed in terms of all performance metrics; it achieved 99.22% accuracy on dataset 1 (eight classes) and 96.63% on dataset 2 (four classes).…”

Section: Resultsmentioning

confidence: 99%

“…They made use of the 8,000 wireless capsule images that were available for viewing in the freely available Kvasir database ( Khan et al, 2022 ). A high-performing outcome for the classification of medical images was achieved by using an efficient augmentation method in conjunction with the classification results, which had an accuracy of 98.28% during training and 93.46% during validation ( Mukhtorov et al, 2023 ). In a separate piece of research, the researchers explain the methodologies and processes for applying deep learning algorithms to examine a wide variety of gastrointestinal disorders and recognize these images.…”

Section: Related Workmentioning

confidence: 99%

Efficient-gastro: optimized EfficientNet model for the detection of gastrointestinal disorders using transfer learning and wireless capsule endoscopy images

Al-Otaibi,

Rehman,

Mujahid

et al. 2024

PeerJ Computer Science

View full text Add to dashboard Cite

Gastrointestinal diseases cause around two million deaths globally. Wireless capsule endoscopy is a recent advancement in medical imaging, but manual diagnosis is challenging due to the large number of images generated. This has led to research into computer-assisted methodologies for diagnosing these images. Endoscopy produces thousands of frames for each patient, making manual examination difficult, laborious, and error-prone. An automated approach is essential to speed up the diagnosis process, reduce costs, and potentially save lives. This study proposes transfer learning-based efficient deep learning methods for detecting gastrointestinal disorders from multiple modalities, aiming to detect gastrointestinal diseases with superior accuracy and reduce the efforts and costs of medical experts. The Kvasir eight-class dataset was used for the experiment, where endoscopic images were preprocessed and enriched with augmentation techniques. An EfficientNet model was optimized via transfer learning and fine tuning, and the model was compared to the most widely used pre-trained deep learning models. The model’s efficacy was tested on another independent endoscopic dataset to prove its robustness and reliability.

show abstract

“…Convolutional neural networks (CNNs) have revolutionized image recognition tasks due to their ability to learn complex hierarchical features from images. In medical imaging, CNN architectures like EfficientNet [5], VGG-16 [6], ResNet [6], and GoogleNet [7] have been particularly effective. For example, EfficientNet has been utilized for its efficiency and scalability in processing high-resolution medical images, while ResNet's deep residual learning framework helps in learning from enormous datasets commonly used in medical diagnostics.…”

Section: Recent Advances In Medical Imagingmentioning

confidence: 99%

Integrating Principal Component Analysis and Multi-Input Convolutional Neural Networks for Advanced Skin Lesion Cancer Classification

Madinakhon,

Mukhtorov,

Cho

2024

Applied Sciences

Self Cite

View full text Add to dashboard Cite

The importance of early detection in the management of skin lesions, such as skin cancer, cannot be overstated due to its critical role in enhancing treatment outcomes. This study presents an innovative multi-input model that fuses image and tabular data to improve the accuracy of diagnoses. The model incorporates a dual-input architecture, combining a ResNet-152 for image processing with a multilayer perceptron (MLP) for tabular data analysis. To optimize the handling of tabular data, Principal Component Analysis (PCA) is employed to reduce dimensionality, facilitating more focused and efficient model training. The model’s effectiveness is confirmed through rigorous testing, yielding impressive metrics with an F1 score of 98.91%, a recall of 99.19%, and a precision of 98.76%. These results underscore the potential of combining multiple data inputs to provide a nuanced analysis that outperforms single-modality approaches in skin lesion diagnostics.

show abstract

Endoscopic Image Classification Based on Explainable Deep Learning

Cited by 32 publications

References 56 publications

Spatial-attention ConvMixer architecture for classification and detection of gastrointestinal diseases using the Kvasir dataset

Spatial-attention ConvMixer architecture for classification and detection of gastrointestinal diseases using the Kvasir dataset

Efficient-gastro: optimized EfficientNet model for the detection of gastrointestinal disorders using transfer learning and wireless capsule endoscopy images

Integrating Principal Component Analysis and Multi-Input Convolutional Neural Networks for Advanced Skin Lesion Cancer Classification

Contact Info

Product

Resources

About