2023
DOI: 10.2196/47737
|View full text |Cite
|
Sign up to set email alerts
|

Performance of ChatGPT on UK Standardized Admission Tests: Insights From the BMAT, TMUA, LNAT, and TSA Examinations

Abstract: Background Large language models, such as ChatGPT by OpenAI, have demonstrated potential in various applications, including medical education. Previous studies have assessed ChatGPT’s performance in university or professional settings. However, the model’s potential in the context of standardized admission tests remains unexplored. Objective This study evaluated ChatGPT’s performance on standardized admission tests in the United Kingdom, including the B… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
29
1
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 60 publications
(31 citation statements)
references
References 7 publications
0
29
1
1
Order By: Relevance
“…Yet, the AI model struggles with more complex tasks requiring advanced comprehension, analytical abilities, and precise calculations. As indicated by a number of studies, 16,[20][21][22] ChatGPT's limitations in handling scientific and mathematical applications, particularly those demanding high-level cognitive engagement, become evident. Fluctuations in accuracy may be linked to the nature of subfield questions, even without explicit categorization.…”
Section: Discussionmentioning
confidence: 99%
“…Yet, the AI model struggles with more complex tasks requiring advanced comprehension, analytical abilities, and precise calculations. As indicated by a number of studies, 16,[20][21][22] ChatGPT's limitations in handling scientific and mathematical applications, particularly those demanding high-level cognitive engagement, become evident. Fluctuations in accuracy may be linked to the nature of subfield questions, even without explicit categorization.…”
Section: Discussionmentioning
confidence: 99%
“…However, reported rates of correct answers vary dramatically across different examinations and medical fields. 3,4 We aimed to conduct a meta-analysis of studies reporting ChatGPT's performance in medical examinations with multiple-choice questions.…”
Section: Obj Ec Ti V Ementioning
confidence: 99%
“…ChatGPT's performance in different medical knowledge examinations has been recently studied in various medical disciplines. However, reported rates of correct answers vary dramatically across different examinations and medical fields 3,4 . We aimed to conduct a meta‐analysis of studies reporting ChatGPT's performance in medical examinations with multiple‐choice questions.…”
Section: Objectivementioning
confidence: 99%
“…One prominent illustration of this is the Generative Pre-Trained Transformer (GPT), released by Open AI in 2018 [1]. GPT 4.0 has proven remarkable ability in assessing knowledge in specialised domains such as medicine, law, and business [2][3][4]-areas that have historically been the exclusive purview of professionals. Particularly noteworthy is its exceptional performance on assessments like the Korean general surgery board exam, the United States Medical Licensing Exam, and the Wharton MBA final exam, each achieved without the finetuning of the pretrained model [5][6][7].…”
Section: Introductionmentioning
confidence: 99%