2021
DOI: 10.48550/arxiv.2108.08618
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reproducible radiomics through automated machine learning validated on twelve clinical applications

Abstract: Radiomics uses quantitative medical imaging features to predict clinical outcomes. While many radiomics methods have been described in the literature, these are generally designed for a single application. The aim of this study is to generalize radiomics across applications by proposing a framework to automatically construct and optimize the radiomics workflow per application. To this end, we formulate radiomics as a modular workflow, consisting of several components: image and segmentation preprocessing, feat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
12
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(15 citation statements)
references
References 62 publications
2
12
1
Order By: Relevance
“…In addition, not using explicit validation data might give rise to a positive bias, even though nested cross-validation should give a relatively unbiased estimation, if the external data follows the same distribution [13]. Comparing our AUC-ROCs to those of Starmans et al [28], it is striking that using the original set of features on four datasets (Desmoid, GIST, Lipo, Liver), the performance is slightly below the 95% CI reported there. For using all features, only in one case (Liver) was the performance lower, while no difference could be seen when using the best feature set.…”
Section: Discussioncontrasting
confidence: 55%
“…In addition, not using explicit validation data might give rise to a positive bias, even though nested cross-validation should give a relatively unbiased estimation, if the external data follows the same distribution [13]. Comparing our AUC-ROCs to those of Starmans et al [28], it is striking that using the original set of features on four datasets (Desmoid, GIST, Lipo, Liver), the performance is slightly below the 95% CI reported there. For using all features, only in one case (Liver) was the performance lower, while no difference could be seen when using the best feature set.…”
Section: Discussioncontrasting
confidence: 55%
“…Using a single acquisition protocol could improve the performance unaffected by such variations, but it is not always feasible in a multicenter setting and limits the applicability. As the WORC method has previously successfully been used in similar settings [ 35 , 36 ], we do not expect that the poor performance can be explained by the variations in image acquisition alone.…”
Section: Discussionmentioning
confidence: 99%
“…The WORC radiomics method applied in this study has been previously validated in a variety of clinical applications [ 35 , 36 ]. In eleven of the twelve previous studies, the radiomics models had a better performance (mean AUCs between 0.68 and 0.94), and multiple features showed differences in univariate statistical testing, e.g., [ 39 , 44 , 45 , 46 ].…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations