2024
DOI: 10.1136/jcp-2023-209304
|View full text |Cite
|
Sign up to set email alerts
|

Accuracy of GPT-4 in histopathological image detection and classification of colorectal adenomas

Thiyaphat Laohawetwanit,
Chutimon Namboonlue,
Sompon Apornvirat

Abstract: AimsTo evaluate the accuracy of Chat Generative Pre-trained Transformer (ChatGPT) powered by GPT-4 in histopathological image detection and classification of colorectal adenomas using the diagnostic consensus provided by pathologists as a reference standard.MethodsA study was conducted with 100 colorectal polyp photomicrographs, comprising an equal number of adenomas and non-adenomas, classified by two pathologists. These images were analysed by classic GPT-4 for 1 time in October 2023 and custom GPT-4 for 20 … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(5 citation statements)
references
References 20 publications
0
5
0
Order By: Relevance
“…[19] Although integrating image-based questions could provide a more comprehensive assessment of LLMs in pathology, recent literature indicates that the accuracy of medical image analysis by these models remains suboptimal. [20][21][22] Thus, incorporating image-based questions might not yet yield reliable comparisons at this point and could require further technological advancements and validations. Second, the relatively small number of trainee participants is a notable limitation.…”
Section: Discussionmentioning
confidence: 99%
“…[19] Although integrating image-based questions could provide a more comprehensive assessment of LLMs in pathology, recent literature indicates that the accuracy of medical image analysis by these models remains suboptimal. [20][21][22] Thus, incorporating image-based questions might not yet yield reliable comparisons at this point and could require further technological advancements and validations. Second, the relatively small number of trainee participants is a notable limitation.…”
Section: Discussionmentioning
confidence: 99%
“…Although such a conversation is currently possible with standard GPT-4, GPT-4's knowledge on this topic will be limited and the surgeon will not be able to obtain detailed information. Unfortunately, the image analysis function of GPTs is currently not sufficient we underscored the possibilities through text only data, we believe that with the improvement of image recognition function, many new areas of application will arise [21].…”
Section: Clinical and Practical Relevancementioning
confidence: 97%
“…We found two studies investigating OpenAI's system [20,21]. In the first study, an anesthesia team uploaded their department protocols to an cGPT, which achieved 90.8% accuracy on text-based information.…”
Section: Cgpts With 'Knowledge'mentioning
confidence: 99%
“…Off-the-shelf MLLMs, such as GPT4v, are not well-trained on pathology images, so are not very suitable for analyzing histopathological images. In one study involving 100 colorectal polyp photomicrographs, ChatGPT achieved a sensitivity of 74% and specificity of 36% in adenoma detection [87]. Sievert et al trained ChatGPT with 16 oropharyngeal confocal laser endomicroscopy images (8 with squamous cell carcinoma, 8 with normal mucosa) and it was tested with 139 images (83 with squamous cell carcinoma and 56 with healthy normal mucosa), achieving an overall accuracy of 71%, demonstrating an ability for fewshot learning.…”
Section: Multi-modal Large Language Modelsmentioning
confidence: 99%