(1) Background: Chest Computed Tomography (CT) has been proposed as a non-invasive method for confirming the diagnosis of SARS-CoV-2 patients using radiomic features (RFs) and baseline clinical data. The performance of Machine Learning (ML) methods using RFs derived from semi-automatically segmented lungs in chest CT images was investigated regarding the ability to predict the mortality of SARS-CoV-2 patients. (2) Methods: A total of 179 RFs extracted from 436 chest CT images of SARS-CoV-2 patients, and 8 clinical and 6 radiological variables, were used to train and evaluate three ML methods (Least Absolute Shrinkage and Selection Operator [LASSO] regularized regression, Random Forest Classifier [RFC], and the Fully connected Neural Network [FcNN]) for their ability to predict mortality using the Area Under the Curve (AUC) of Receiver Operator characteristic (ROC) Curves. These three groups of variables were used separately and together as input for constructing and comparing the final performance of ML models. (3) Results: All the ML models using only RFs achieved an informative level regarding predictive ability, outperforming radiological assessment, without however reaching the performance obtained with ML based on clinical variables. The LASSO regularized regression and the FcNN performed equally, both being superior to the RFC. (4) Conclusions: Radiomic features based on semi-automatically segmented CT images and ML approaches can aid in identifying patients with a high risk of mortality, allowing a fast, objective, and generalizable method for improving prognostic assessment by providing a second expert opinion that outperforms human evaluation.
Background: Few studies have focused on predicting the overall survival (OS) of patients affected by SARS-CoV-2 (i.e., COVID-19) using radiomic features (RFs) extracted from computer tomography (CT) images. Reconstruction of CT scans might potentially affect the values of RFs. Methods: Out of 435 patients, 239 had the scans reconstructed with a single modality, and hence, were used for training/testing, and 196 were reconstructed with two modalities were used as validation to evaluate RFs robustness to reconstruction. During training, the dataset was split into train/test using a 70/30 proportion, randomizing the procedure 100 times to obtain 100 different models. In all cases, RFs were normalized using the z-score and then given as input into a Cox proportional-hazards model regularized with the Least Absolute Shrinkage and Selection Operator (LASSO-Cox), used for feature selection and developing a robust model. The RFs retained multiple times in the models were also included in a final LASSO-Cox for developing the predictive model. Thus, we conducted sensitivity analysis increasing the number of retained RFs with an occurrence cut-off from 11% to 60%. The Bayesian information criterion (BIC) was used to identify the cut-off to build the optimal model. Results: The best BIC value indicated 45% as the optimal occurrence cut-off, resulting in five RFs used for generating the final LASSO-Cox. All the Kaplan-Meier curves of training and validation datasets were statistically significant in identifying patients with good and poor prognoses, irrespective of CT reconstruction. Conclusions: The final LASSO-Cox model maintained its predictive ability for predicting the OS in COVID-19 patients irrespective of CT reconstruction algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.