2024
DOI: 10.1186/s12911-024-02757-z
|View full text |Cite
|
Sign up to set email alerts
|

Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Cindy N. Ho,
Tiffany Tian,
Alessandra T. Ayers
et al.

Abstract: Background The large language models (LLMs), most notably ChatGPT, released since November 30, 2022, have prompted shifting attention to their use in medicine, particularly for supporting clinical decision-making. However, there is little consensus in the medical community on how LLM performance in clinical contexts should be evaluated. Methods We performed a literature review of PubMed to identify publications between December 1, 2022, and April 1, 2024, that discussed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 122 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?