Deep Learning Techniques for Ear Diseases Based on Segmentation of the Normal Tympanic Membrane

Park, Yongsoon; Jeon, Jun Ho; Kong, Tae Hoon; Chung, Tae Yun; Seo, Young Joon

doi:10.21053/ceo.2022.00675

Cited by 7 publications

(3 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The emergence of artificial intelligence (AI) has altered the landscape of medical technology, particularly in diagnosis, which leverages the identification of features based on imaging and physiological data [1][2][3]. In the field of otolaryngology, AI and deep learning models are being used for imaging; ongoing efforts focus on classifying diseases based on tympanic membrane images of middle ear disease [4][5][6]. Technological advancements, including deep learning and transfer learning using pre-trained models, have resulted in an accuracy range of 70-90% in models for analyzing otoscopic images [7].…”

Section: Introductionmentioning

confidence: 99%

Feasibility of Multimodal Artificial Intelligence Using GPT-4 Vision for the Classification of Middle Ear Disease: Qualitative Study and Validation

Noda,

Yoshimura,

Okubo

et al. 2024

JMIR AI

View full text Add to dashboard Cite

Background The integration of artificial intelligence (AI), particularly deep learning models, has transformed the landscape of medical technology, especially in the field of diagnosis using imaging and physiological data. In otolaryngology, AI has shown promise in image classification for middle ear diseases. However, existing models often lack patient-specific data and clinical context, limiting their universal applicability. The emergence of GPT-4 Vision (GPT-4V) has enabled a multimodal diagnostic approach, integrating language processing with image analysis. Objective In this study, we investigated the effectiveness of GPT-4V in diagnosing middle ear diseases by integrating patient-specific data with otoscopic images of the tympanic membrane. Methods The design of this study was divided into two phases: (1) establishing a model with appropriate prompts and (2) validating the ability of the optimal prompt model to classify images. In total, 305 otoscopic images of 4 middle ear diseases (acute otitis media, middle ear cholesteatoma, chronic otitis media, and otitis media with effusion) were obtained from patients who visited Shinshu University or Jichi Medical University between April 2010 and December 2023. The optimized GPT-4V settings were established using prompts and patients’ data, and the model created with the optimal prompt was used to verify the diagnostic accuracy of GPT-4V on 190 images. To compare the diagnostic accuracy of GPT-4V with that of physicians, 30 clinicians completed a web-based questionnaire consisting of 190 images. Results The multimodal AI approach achieved an accuracy of 82.1%, which is superior to that of certified pediatricians at 70.6%, but trailing behind that of otolaryngologists at more than 95%. The model’s disease-specific accuracy rates were 89.2% for acute otitis media, 76.5% for chronic otitis media, 79.3% for middle ear cholesteatoma, and 85.7% for otitis media with effusion, which highlights the need for disease-specific optimization. Comparisons with physicians revealed promising results, suggesting the potential of GPT-4V to augment clinical decision-making. Conclusions Despite its advantages, challenges such as data privacy and ethical considerations must be addressed. Overall, this study underscores the potential of multimodal AI for enhancing diagnostic accuracy and improving patient care in otolaryngology. Further research is warranted to optimize and validate this approach in diverse clinical settings.

show abstract

Section: Introductionmentioning

confidence: 99%

Feasibility of Multimodal Artificial Intelligence Using GPT-4 Vision for the Classification of Middle Ear Disease: Qualitative Study and Validation

Noda,

Yoshimura,

Okubo

et al. 2024

JMIR AI

View full text Add to dashboard Cite

show abstract

“…Due to bacterial infection, AOM may occur in the middle of the ear, bringing about the build-up of fluid. OME leads to fluid build-up in the middle of the ear because of inflammation, which is much more severe than AOM [3]. The standard diagnosis depends on pneumatic otoscopy, the benchmark for distinguishing AOM from OME [4].…”

Section: Introductionmentioning

confidence: 99%

Artificial Intelligence With Deep Learning Based Automated Ear Infection Detection

Mehedi,

Hanif,

Bilal

et al. 2024

IEEE Access

View full text Add to dashboard Cite

Artificial intelligence (AI) related to intelligent control in healthcare denotes using AI techniques to enhance the management and control of healthcare processes and systems. Damage to the inner and middle ear caused by accidents and diseases even causes hearing impairment in the ear that has been harmed or injured. Traditional otoscopy devices were utilized to check the tympanic membrane (TM) to identify OM in medical practice, and a conclusion is drawn depending on the outcomes of the examination. While developing a computer-aided method to support the OM diagnosis, it is possible to focus on methods like feature extraction, image pre-processing, classification, and image segmentation. The existing methodology of detecting the ear infection experiences a reduction of accuracy due to the influence of the noise in the input ear image. This presence of noise affects the feature extraction process, directly influences the accuracy in detection process. To overcome this issue, in this manuscript, a Deep learning (DL) is utilized to find biomedical ear infections by examining images of the eardrum and ear canal. The process includes training a DL method with a large dataset of ear images, where the images were labeled as either not infected or infected. With this motivation, this article emphasizes the design of Bayesian optimization with a deep learning-based automated ear infection detection and classification (BODL-AEIDC) model. The BODL-AEIDC technique exploits the DL model with a metaheuristic optimization algorithm for the ear infection classification process. The BODL-AEIDC technique employs a Wiener filtering (WF) based noise removal process to eliminate the noise data. In addition, the BODL-AEIDC technique exploits W-Net-based segmentation and the EfficientNet model for feature extraction purposes. Moreover, the BODL-AEIDC technique employs a fuzzy Restricted Boltzmann machine (FRBM) model for ear infection detection. Furthermore, the BO algorithm is utilized to adjust the FRBM technique's hyperparameter values effectively. The BODL-AEIDC technique's experimental outcomes occur using the medical dataset. The comprehensive comparative study stated the enhanced performance of the BODL-AEIDC approach over other existing methods.

show abstract

“…AOM, or middle ear mass, can result from bacterial infection and cause fluid accumulation. OME causes a buildup of fluid in the middle of the ear due to inflammation, which is significantly worse than AOM [3]. Pneumatic otoscopy, the gold standard for differentiating between AOM and OME, is required for the conventional diagnosis [4].…”

Section: Introductionmentioning

confidence: 99%

Artificial Intelligence With Deep Learning Based Automated Ear Infection Detection

2024

IRJMETS

View full text Add to dashboard Cite

Deep Learning Techniques for Ear Diseases Based on Segmentation of the Normal Tympanic Membrane

Cited by 7 publications

References 23 publications

Feasibility of Multimodal Artificial Intelligence Using GPT-4 Vision for the Classification of Middle Ear Disease: Qualitative Study and Validation

Feasibility of Multimodal Artificial Intelligence Using GPT-4 Vision for the Classification of Middle Ear Disease: Qualitative Study and Validation

Artificial Intelligence With Deep Learning Based Automated Ear Infection Detection

Artificial Intelligence With Deep Learning Based Automated Ear Infection Detection

Contact Info

Product

Resources

About