Reproducible radiomics through automated machine learning validated on twelve clinical applications

Starmans, Martijn P.A.; Voort, Sebastian R. van der; Phil, Thomas; Timbergen, Milea J M; Vos, Melissa; Padmos, Guillaume A.; Kessels, Wouter; Hanff, David; Grünhagen, Dirk J.; Verhoef, Cornelis; Sleijfer, Stefan; Bent, Martin J. van den; Smits, Marion; Dwarkasing, Roy S.; Els, Christopher J.; Federico, Fiduzi,; Leenders, Geert J.L.H. van; Blažević, Anela; Hofland, Johannes; Brabander, Tessa; Gils, Renza A. H. van; Franssen, Gaston J H; Feelders, Richard A; Herder, Wouter W. de; Buisman, Florian E.; Willemssen, Francois E. J. A.; Koerkamp, B. Groot; Angus, Lindsay; Veldt, Astrid A.M. van der; Rajicic, Ana; Odink, Arlette E.; Deen, M. Jamal; Veenland, Jifke F.; Schoots, Ivo G.; Renckens, Michel; Doukas, Michail; Man, Rob A. de; IJzermans, Jan N. M.; Miclea, Razvan L.; Vermeulen, Peter; Bron, Esther E.; Thomeer, Maarten; Visser, Jacob J.; Niessen, Wiro J.; Klein, Stefan

doi:10.48550/arxiv.2108.08618

Cited by 7 publications

(15 citation statements)

References 62 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, not using explicit validation data might give rise to a positive bias, even though nested cross-validation should give a relatively unbiased estimation, if the external data follows the same distribution [13]. Comparing our AUC-ROCs to those of Starmans et al [28], it is striking that using the original set of features on four datasets (Desmoid, GIST, Lipo, Liver), the performance is slightly below the 95% CI reported there. For using all features, only in one case (Liver) was the performance lower, while no difference could be seen when using the best feature set.…”

Section: Discussioncontrasting

confidence: 55%

The effect of preprocessing filters on predictive performance in radiomics

Demircioğlu

2022

Eur Radiol Exp

View full text Add to dashboard Cite

Background Radiomics is a noninvasive method using machine learning to support personalised medicine. Preprocessing filters such as wavelet and Laplacian-of-Gaussian filters are commonly used being thought to increase predictive performance. However, the use of preprocessing filters increases the number of features by up to an order of magnitude and can produce many correlated features. Both substantially increase the dataset complexity, which in turn makes modeling with machine learning techniques more challenging, possibly leading to poorer performance. We investigated the impact of these filters on predictive performance. Methods Using seven publicly available radiomic datasets, we measured the impact of adding features preprocessed with eight different preprocessing filters to the unprocessed features on the predictive performance of radiomic models. Modeling was performed using five feature selection methods and five classifiers, while predictive performance was measured using area-under-the-curve at receiver operating characteristics analysis (AUC-ROC) with nested, stratified 10-fold cross-validation. Results Significant improvements of up to 0.08 in AUC-ROC were observed when all image preprocessing filters were applied compared to using only the original features (up to p = 0.024). Decreases of -0.04 and -0.10 were observed on some data sets, but these were not statistically significant (p > 0.179). Tuning of the image preprocessing filters did not result in decreases in AUC-ROC but further improved results by up to 0.1; however, these improvements were not statistically significant (p > 0.086) except for one data set (p = 0.023). Conclusions Preprocessing filters can have a significant impact on the predictive performance and should be used in radiomic studies.

show abstract

Section: Discussioncontrasting

confidence: 55%

The effect of preprocessing filters on predictive performance in radiomics

Demircioğlu

2022

Eur Radiol Exp

View full text Add to dashboard Cite

show abstract

“…Using a single acquisition protocol could improve the performance unaffected by such variations, but it is not always feasible in a multicenter setting and limits the applicability. As the WORC method has previously successfully been used in similar settings [ 35 , 36 ], we do not expect that the poor performance can be explained by the variations in image acquisition alone.…”

Section: Discussionmentioning

confidence: 99%

“…The WORC radiomics method applied in this study has been previously validated in a variety of clinical applications [ 35 , 36 ]. In eleven of the twelve previous studies, the radiomics models had a better performance (mean AUCs between 0.68 and 0.94), and multiple features showed differences in univariate statistical testing, e.g., [ 39 , 44 , 45 , 46 ].…”

Section: Discussionmentioning

confidence: 99%

“…Therefore, further research is needed to investigate the use of various MRI protocols (e.g., T2-weighted, dynamic-contrast-enhanced and diffusion-weighted MRI [ 62 ],) for the detection of lymph nodes in patients with MIBC, as well as the relatively novel PET/MRI approach [ 63 ]. Since our radiomics approach has previously been proven successful for MRI [ 35 , 36 ], the same approach could be evaluated to differentiate pN+ and pN0 disease based on MRI.…”

Section: Discussionmentioning

confidence: 99%

“…The radiomics analysis were performed using the Workflow for Optimal Radiomics Classification (WORC) toolbox [ 35 , 36 ]; an overview of the radiomics methodology is depicted in Figure 1 . For each ROI (e.g., per lymph node or primary tumor segmentation), 564 features quantifying intensity, shape and texture were extracted from the CT scan.…”

Section: Methodsmentioning

confidence: 99%

See 2 more Smart Citations

Optimization of Preoperative Lymph Node Staging in Patients with Muscle-Invasive Bladder Cancer Using Radiomics on Computed Tomography

Starmans

Smits

et al. 2022

JPM

Self Cite

View full text Add to dashboard Cite

Approximately 25% of the patients with muscle-invasive bladder cancer (MIBC) who are clinically node negative have occult lymph node metastases at radical cystectomy (RC) and pelvic lymph node dissection. The aim of this study was to evaluate preoperative CT-based radiomics to differentiate between pN+ and pN0 disease in patients with clinical stage cT2-T4aN0-N1M0 MIBC. Patients with cT2-T4aN0-N1M0 MIBC, of whom preoperative CT scans and pathology reports were available, were included from the prospective, multicenter CirGuidance trial. After manual segmentation of the lymph nodes, 564 radiomics features were extracted. A combination of different machine-learning methods was used to develop various decision models to differentiate between patients with pN+ and pN0 disease. A total of 209 patients (159 pN0; 50 pN+) were included, with a total of 3153 segmented lymph nodes. None of the individual radiomics features showed significant differences between pN+ and pN0 disease, and none of the radiomics models performed substantially better than random guessing. Hence, CT-based radiomics does not contribute to differentiation between pN+ and pN0 disease in patients with cT2-T4aN0-N1M0 MIBC.

show abstract

CT-radiomics and clinical risk scores for response and overall survival prognostication in TACE HCC patients

Bernatz

Elenberger

Ackermann

et al. 2023

Sci Rep

View full text Add to dashboard Cite

We aimed to identify hepatocellular carcinoma (HCC) patients who will respond to repetitive transarterial chemoembolization (TACE) to improve the treatment algorithm. Retrospectively, 61 patients (mean age, 65.3 years ± 10.0 [SD]; 49 men) with 94 HCC mRECIST target-lesions who had three consecutive TACE between 01/2012 and 01/2020 were included. Robust and non-redundant radiomics features were extracted from the 24 h post-embolization CT. Five different clinical TACE-scores were assessed. Seven different feature selection methods and machine learning models were used. Radiomics, clinical and combined models were built to predict response to TACE on a lesion-wise and patient-wise level as well as its impact on overall-survival prognostication. 29 target-lesions of 19 patients were evaluated in the test set. Response rates were 37.9% (11/29) on the lesion-level and 42.1% (8/19) on the patient-level. Radiomics top lesion-wise response prognostications was AUC 0.55–0.67. Clinical scores revealed top AUCs of 0.65–0.69. The best working model combined the radiomic feature LargeDependenceHighGrayLevelEmphasis and the clinical score mHAP_II_score_group with AUC = 0.70, accuracy = 0.72. We transferred this model on a patient-level to achieve AUC = 0.62, CI = 0.41–0.83. The two radiomics-clinical features revealed overall-survival prognostication of C-index = 0.67. In conclusion, a random forest model using the radiomic feature LargeDependenceHighGrayLevelEmphasis and the clinical mHAP-II-score-group seems promising for TACE response prognostication.

show abstract

Reproducible radiomics through automated machine learning validated on twelve clinical applications

Cited by 7 publications

References 62 publications

The effect of preprocessing filters on predictive performance in radiomics

The effect of preprocessing filters on predictive performance in radiomics

Optimization of Preoperative Lymph Node Staging in Patients with Muscle-Invasive Bladder Cancer Using Radiomics on Computed Tomography

CT-radiomics and clinical risk scores for response and overall survival prognostication in TACE HCC patients

Contact Info

Product

Resources

About