The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Warrier, Akshay; Singh, Rohan; Haleem, Afash; Zaki, Haider; Eloy, Jean Anderson

doi:10.1002/lary.31434

The Laryngoscope

2024

DOI: 10.1002/lary.31434

|View full text |Cite

The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Akshay Warrier,

Rohan Singh,

Afash Haleem

et al.

Abstract: ObjectivesEvaluate and compare the ability of large language models (LLMs) to diagnose various ailments in otolaryngology.MethodsWe collected all 100 clinical vignettes from the second edition of Otolaryngology Cases—The University of Cincinnati Clinical Portfolio by Pensak et al. With the addition of the prompt “Provide a diagnosis given the following history,” we prompted ChatGPT‐3.5, Google Bard, and Bing‐GPT4 to provide a diagnosis for each vignette. These diagnoses were compared to the portfolio for accur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article5

Relationship

Self Cite0

Independent5

Authors

Journals

Cited by 6 publications

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

In Reference to The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Maniaci,

Lentini,

Boscolo‐Rizzo

et al. 2024

The Laryngoscope

View full text Add to dashboard Cite

In Reference to The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Maniaci,

Lentini,

Boscolo‐Rizzo

et al. 2024

The Laryngoscope

View full text Add to dashboard Cite

In Response to The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Warrier,

Singh,

Haleem

et al. 2024

The Laryngoscope

View full text Add to dashboard Cite

Chasing sleep physicians: ChatGPT-4o on the interpretation of polysomnographic results

Seifen,

Huppertz,

Gouveris

et al. 2024

Eur Arch Otorhinolaryngol

View full text Add to dashboard Cite

Background From a healthcare professional's perspective, the use of ChatGPT (Open AI), a large language model (LLM), offers huge potential as a practical and economic digital assistant. However, ChatGPT has not yet been evaluated for the interpretation of polysomnographic results in patients with suspected obstructive sleep apnea (OSA). Aims/objectives To evaluate the agreement of polysomnographic result interpretation between ChatGPT-4o and a board-certified sleep physician and to shed light into the role of ChatGPT-4o in the field of medical decision-making in sleep medicine. Material and methods For this proof-of-concept study, 40 comprehensive patient profiles were designed, which represent a broad and typical spectrum of cases, ensuring a balanced distribution of demographics and clinical characteristics. After various prompts were tested, one prompt was used for initial diagnosis of OSA and a further for patients with positive airway pressure (PAP) therapy intolerance. Each polysomnographic result was independently evaluated by ChatGPT-4o and a board-certified sleep physician. Diagnosis and therapy suggestions were analyzed for agreement. Results ChatGPT-4o and the sleep physician showed 97% (29/30) concordance in the diagnosis of the simple cases. For the same cases the two assessment instances unveiled 100% (30/30) concordance regarding therapy suggestions. For cases with intolerance of treatment with positive airway pressure (PAP) ChatGPT-4o and the sleep physician revealed 70% (7/10) concordance in the diagnosis and 44% (22/50) concordance for therapy suggestions. Conclusion and significance Precise prompting improves the output of ChatGPT-4o and provides sleep physician-like polysomnographic result interpretation. Although ChatGPT shows some shortcomings in offering treatment advice, our results provide evidence for AI assisted automation and economization of polysomnographic interpretation by LLMs. Further research should explore data protection issues and demonstrate reproducibility with real patient data on a larger scale.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Cited by 6 publications

References 33 publications

In Reference to The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

In Reference to The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

In Response to The Comparative Diagnostic Capability of Large Language Models in Otolaryngology

Chasing sleep physicians: ChatGPT-4o on the interpretation of polysomnographic results

Contact Info

Product

Resources

About