2024
DOI: 10.1016/j.arthro.2023.07.048
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation High-Quality of Information from ChatGPT (Artificial Intelligence—Large Language Model) Artificial Intelligence on Shoulder Stabilization Surgery

Eoghan T. Hurley,
Bryan S. Crook,
Samuel G. Lorentz
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 46 publications
(21 citation statements)
references
References 22 publications
0
21
0
Order By: Relevance
“…Low-medium quality DISCERN and JAMA scores and difficult readability have been reported in studies on different subjects, similar to our study. [35,38–41] However, in our study, among the artificial intelligence chat robots, perplexity especially attracts attention with its high DISCERN and JAMA scores. The fact that the answers of this artificial intelligence chatbot contain references has caused its DISCERN and JAMA scores to be high.…”
Section: Discussionmentioning
confidence: 79%
“…Low-medium quality DISCERN and JAMA scores and difficult readability have been reported in studies on different subjects, similar to our study. [35,38–41] However, in our study, among the artificial intelligence chat robots, perplexity especially attracts attention with its high DISCERN and JAMA scores. The fact that the answers of this artificial intelligence chatbot contain references has caused its DISCERN and JAMA scores to be high.…”
Section: Discussionmentioning
confidence: 79%
“…The authors were primarily affiliated with institutions in the United States (n=47 of 122 different countries identified per publication, 38.5%), followed by Germany (n=11/122, 9%), Turkey (n=7/122, 5.7%), the United Kingdom (n=6/122, 4.9%), China/Australia/Italy (n=5/122, 4.1%, respectively), and 24 (n=36/122, 29.5%) other countries. Most studies examined one or more applications based on the GPT-3.5 architecture (n=66 of 124 different LLMs examined per study, 53.2%) 13,2629,3134,3640,4249,5254,5661,63,6567,71,72,74,75,77,78,8189,91,92,94,95,97100,102–104,106109,111 , followed by GPT-4 (n=33/124, 26.6%) 13,25,27,29,30,3436,41,43,50,51,54,55,58,61,64,6870,74,76,7981,83,87,89,90,93,96,98,99,101,105 , Bard (n=10/124, 8.1%; now known as Gemini) 33,48,49,55,73,74,80,87,94,99 , Bing Chat (n=7/124, 5.7%; now Microsoft Copilot) 49,51,55,73,94,99,110 , and other applications based on Bidirectional Encoder Representations from Transformers (BERT; n=4/124, 3...…”
Section: Resultsmentioning
confidence: 99%
“…Most studies examined one or more applications based on the GPT-3.5 architecture (n=66 of 124 different LLMs examined per study, 53.2%) 13,2629,3134,3640,4249,5254,5661,63,6567,71,72,74,75,77,78,8189,91,92,94,95,97100,102–104,106109,111 , followed by GPT-4 (n=33/124, 26.6%) 13,25,27,29,30,3436,41,43,50,51,54,55,58,61,64,6870,74,76,7981,83,87,89,90,93,96,98,99,101,105 , Bard (n=10/124, 8.1%; now known as Gemini) 33,48,49,55,73,74,80,87,94,99 , Bing Chat (n=7/124, 5.7%; now Microsoft Copilot) 49,51,55,73,94,99,110 , and other applications based on Bidirectional Encoder Representations from Transformers (BERT; n=4/124, 3.2%) 13,83,84 , Large Language Model Meta-AI (LLaMA; n=3/124, 2.4%) 55 , or Claude by Anthropic (n=1/124, 0.8%) 55 . The majority of applications were p...…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…AI‐LLMs have been utilised as a decision support tool for selecting imaging examinations and generating radiology referrals in the emergency department setting [2]. Additionally, AI‐LLMs have been demonstrated to produce highly accurate, digestible information for patients regarding various orthopaedic pathologies and procedures including anterior cruciate ligament tears and shoulder stabilisation procedures [9, 15].…”
Section: Introductionmentioning
confidence: 99%