Objectives
The purpose of this study is to inquire about the potential association between radiomics features and the pathological nature of thyroid nodules (TNs), and to propose an interpretable radiomics-based model for predicting the risk of malignant TN.
Methods
In this retrospective study, computed tomography (CT) imaging and pathological data from 141 patients with TN were collected. The data were randomly stratified into a training group (n = 112) and a validation group (n = 29) at a ratio of 4:1. A total of 1316 radiomics features were extracted by using the pyradiomics tool. The redundant features were removed through correlation testing, and the least absolute shrinkage and selection operator (LASSO) or the minimum redundancy maximum relevance standard was used to select features. Finally, 4 different machine learning models (RF Hybrid Feature, SVM Hybrid Feature, RF, and LASSO) were constructed. The performance of the 4 models was evaluated using the receiver operating characteristic curve. The calibration curve, decision curve analysis, and SHapley Additive exPlanations method were used to evaluate or explain the best radiomics machine learning model.
Results
The optimal radiomics model (RF Hybrid Feature model) demonstrated a relatively high degree of discrimination with an area under the receiver operating characteristic curve (AUC) of 0.87 (95% CI, 0.70–0.97; P < 0.001) for the validation cohort. Compared with the commonly used LASSO model (AUC, 0.78; 95% CI, 0.60–0.91; P < 0.01), there is a significant improvement in AUC in the validation set, net reclassification improvement, 0.79 (95% CI, 0.13–1.46; P < 0.05), and integrated discrimination improvement, 0. 20 (95% CI, 0.10–0.30; P < 0.001).
Conclusion
The interpretable radiomics model based on CT performs well in predicting benign and malignant TNs by using quantitative radiomics features of the unilateral total thyroid. In addition, the data preprocessing method incorporating different layers of features has achieved excellent experimental results.
Clinical relevance statement
As the detection rate of TNs continues to increase, so does the diagnostic burden on radiologists. This study establishes a noninvasive, interpretable and accurate machine learning model to rapidly identify the nature of TN found in CT.