2021
DOI: 10.1186/s12911-021-01492-z
|View full text |Cite
|
Sign up to set email alerts
|

Automated classification of clinical trial eligibility criteria text based on ensemble learning and metric learning

Abstract: Background Eligibility criteria are the primary strategy for screening the target participants of a clinical trial. Automated classification of clinical trial eligibility criteria text by using machine learning methods improves recruitment efficiency to reduce the cost of clinical research. However, existing methods suffer from poor classification performance due to the complexity and imbalance of eligibility criteria text data. Methods An ensemble… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 28 publications
0
4
0
1
Order By: Relevance
“…We can only use the truncation method to preprocess text, that is, first truncation, tail truncation, and head to tail truncation, which adds some difficulty to the preliminary work. According to the work of Zeng et al, the base model did improve the accuracy rate by adjusting the downstream tasks [30]. For the single model, XLNET and RoBERTa were better than BERT and ERNIE, and the integration of multiple models will improve the model by 2.58% on average.…”
Section: Discussionmentioning
confidence: 99%
“…We can only use the truncation method to preprocess text, that is, first truncation, tail truncation, and head to tail truncation, which adds some difficulty to the preliminary work. According to the work of Zeng et al, the base model did improve the accuracy rate by adjusting the downstream tasks [30]. For the single model, XLNET and RoBERTa were better than BERT and ERNIE, and the integration of multiple models will improve the model by 2.58% on average.…”
Section: Discussionmentioning
confidence: 99%
“…In this work, we implemented an ensemble learning framework for miRNA-disease association prediction. Inspired by the previous research ( Chen et al, 2019b ; Dai et al, 2020 ; Sherazi et al, 2021 ; Wang et al, 2021 ; Zeng et al, 2021 ), we built the CSMDA through the following three stages: 1) construct multiple training subsets to increase the diversity of base classifiers by randomly sampling from ; 2) perform the random forest-based feature selection to reduce noise and redundant information in the high-dimensional feature space; 3) use soft voting strategy to integrate the prediction results of all base classifiers. The process of constructing the ensemble learning framework is shown in Figure 3 .…”
Section: Methodsmentioning
confidence: 99%
“…Here, let represent the number of training subsets. Take an unknown miRNA-disease pair as sample input, m base classifiers could produce m prediction result for the sample, and then the prediction results were integrated by the soft voting strategy ( Sherazi et al, 2021 ; Wang et al, 2021 ; Zeng et al, 2021 ). Specifically, the output of the th sample by soft voting was defined as follows: …”
Section: Methodsmentioning
confidence: 99%
“…Kriteria kelayakan adalah strategi utama untuk melakukan penyaringan target dalam sebuah uji klinis. Klasifikasi otomatis teks kriteria kelayakan uji klinis dengan menggunakan metode pembelajaran mesin meningkatkan efisiensi rekrutmen untuk mengurangi biaya riset klinis [12]. Karena persyaratan kelayakan untuk uji klinis biasanya ditulis dalam teks bebas, pemrosesannya memerlukan interpretasi komputer [13] Beberapa penelitian telah dilakukan oleh para peneliti dalam mengklasifikasikan data klinis kanker.…”
Section: Pendahuluanunclassified