2024
DOI: 10.2196/57978
|View full text |Cite
|
Sign up to set email alerts
|

The Evaluation of Generative AI Should Include Repetition to Assess Stability

Lingxuan Zhu,
Weiming Mou,
Chenglin Hong
et al.

Abstract: The increasing interest in the potential applications of generative artificial intelligence (AI) models like ChatGPT in health care has prompted numerous studies to explore its performance in various medical contexts. However, evaluating ChatGPT poses unique challenges due to the inherent randomness in its responses. Unlike traditional AI models, ChatGPT generates different responses for the same input, making it imperative to assess its stability through repetition. This commentary highlights the importance o… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 19 publications
0
0
0
Order By: Relevance