Constructing a finer-grained representation of clinical trial results from ClinicalTrials.gov

Shi, Xuanyu; Du, Jian

doi:10.1038/s41597-023-02869-7

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Preprint1

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Emerging technologies for drug repurposing: Harnessing the potential of text and graph embedding approaches

Dong,

Zheng

2024

Artificial Intelligence Chemistry

View full text Add to dashboard Cite

Emerging technologies for drug repurposing: Harnessing the potential of text and graph embedding approaches

Dong,

Zheng

2024

Artificial Intelligence Chemistry

View full text Add to dashboard Cite

A dataset for evaluating clinical research claims in large language models

Zhang,

Yazdani,

Bornet

et al. 2024

Preprint

View full text Add to dashboard Cite

Large language models (LLMs) have the potential to enhance the verification of health claims. However, issues with hallucination and comprehension of logical statements require these models to be closely scrutinized in healthcare applications. We introduce CliniFact, a scientific claim dataset created from hypothesis testing results in clinical research, covering 992 unique interventions for 22 disease categories. The dataset used study arms and interventions, primary outcome measures, and results from clinical trials to derive and label clinical research claims. These claims were then linked to supporting information describing clinical trial results in scientific publications. CliniFact contains 1,970 scientific claims from 992 unique clinical trials related to 1,540 unique publications. Intrinsic evaluation yields a Cohen's Kappa score of 0.83, indicating strong inter-annotator agreement. In extrinsic evaluations, discriminative LLMs, such as PubMedBERT, achieved 81% accuracy and 79% F1-score, outperforming generative LLMs, such as Llama3-70B, which reached 52% accuracy and 39% F1-score. Our results demonstrate the potential of CliniFact as a benchmark for evaluating LLM performance in clinical research claim verification.

show abstract

Constructing a finer-grained representation of clinical trial results from ClinicalTrials.gov

Cited by 2 publications

References 26 publications

Emerging technologies for drug repurposing: Harnessing the potential of text and graph embedding approaches

Emerging technologies for drug repurposing: Harnessing the potential of text and graph embedding approaches

A dataset for evaluating clinical research claims in large language models

Contact Info

Product

Resources

About