2023
DOI: 10.3390/systems11070351
|View full text |Cite
|
Sign up to set email alerts
|

Harnessing the Power of ChatGPT for Automating Systematic Review Process: Methodology, Case Study, Limitations, and Future Directions

Abstract: Systematic reviews (SR) are crucial in synthesizing and analyzing existing scientific literature to inform evidence-based decision-making. However, traditional SR methods often have limitations, including a lack of automation and decision support, resulting in time-consuming and error-prone reviews. To address these limitations and drive the field forward, we harness the power of the revolutionary language model, ChatGPT, which has demonstrated remarkable capabilities in various scientific writing tasks. By ut… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 74 publications
(49 citation statements)
references
References 51 publications
0
23
0
Order By: Relevance
“…First, we prompted the LLM GPT-4 to answer specific questions about entire papers whereas previous studies were often optimized for screening paper title and abstracts (5)(6)(7)(8). Second, we used GPT without providing training examples, whereas previous studies, were often interactive in that they combined NLP and ML algorithms with user feedback (5,6,(9)(10)(11). Finally, our results were quantitative and transparent, whereas several previous studies, particularly those using LLMs, presented their results in a qualitative manner.…”
Section: Discussionmentioning
confidence: 94%
See 1 more Smart Citation
“…First, we prompted the LLM GPT-4 to answer specific questions about entire papers whereas previous studies were often optimized for screening paper title and abstracts (5)(6)(7)(8). Second, we used GPT without providing training examples, whereas previous studies, were often interactive in that they combined NLP and ML algorithms with user feedback (5,6,(9)(10)(11). Finally, our results were quantitative and transparent, whereas several previous studies, particularly those using LLMs, presented their results in a qualitative manner.…”
Section: Discussionmentioning
confidence: 94%
“…Most such tools have used natural language processing (NLP) and machine learning (ML) algorithms primarily to screen the titles and abstracts of publications to determine whether they meet the search criteria for a systematic review (2)(3)(4)(5)(6)(7)(8). Several studies have also described the potential for using the representational language model BERT (Bidirectional Encoder Representations from Transformers) and the Generative Pre-trained Transformer (GPT) large language models (LLMs) for reviewing the full text of published studies (9)(10)(11)(12). LLMs have also been evaluated for their ability to summarize research studies (13,14).…”
Section: Introductionmentioning
confidence: 99%
“…[ 96–99 ] While they are not designed to replace human experts with specialized domain knowledge, these models could significantly streamline and simplify the process of conducting literature reviews. [ 98,100 ] This could become an invaluable tool for researchers trying to understand a field and identify emerging trends and key discoveries within it.…”
Section: The Third Rung: Generative Models and Hypothesis Generationmentioning
confidence: 99%
“…There are some initial attempts to evaluate ChatGPTs in automated SR, such as automating search queries [49]. Alshami et al [50] leveraged GPT to automate the SR pipeline, yet their methodology diverges from conventional abstract screening practices, rendering it distinct from traditional approaches. The application of GPT in abstract screening has been scarcely explored.…”
Section: Background Studymentioning
confidence: 99%