COVID-19 clinical presentation and prognosis are highly variable, ranging from asymptomatic and paucisymptomatic cases to acute respiratory distress syndrome and multi-organ involvement. We developed a hybrid machine learning/deep learning model to classify patients in two outcome categories, non-ICU and ICU (intensive care admission or death), using 558 patients admitted in a northern Italy hospital in February/May of 2020. A fully 3D patient-level CNN classifier on baseline CT images is used as feature extractor. Features extracted, alongside with laboratory and clinical data, are fed for selection in a Boruta algorithm with SHAP game theoretical values. A classifier is built on the reduced feature space using CatBoost gradient boosting algorithm and reaching a probabilistic AUC of 0.949 on holdout test set. The model aims to provide clinical decision support to medical doctors, with the probability score of belonging to an outcome class and with case-based SHAP interpretation of features importance.
The coronavirus disease 2019 (COVID-19) pandemic is the most severe global health and socioeconomic crisis of our time, and represents the greatest challenge faced by the world since the end of the Second World War. The academic literature indicates that climatic features, specifically temperature and absolute humidity, are very important factors affecting infectious pulmonary disease epidemics - such as severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS); however, the influence of climatic parameters on COVID-19 remains extremely controversial. The goal of this study is to individuate relationships between several climate parameters (temperature, relative humidity, accumulated precipitation, solar radiation, evaporation, and wind direction and intensity), local morphological parameters, and new daily positive swabs for COVID-19, which represents the only parameter that can be statistically used to quantify the pandemic. The daily deaths parameter was not considered, because it is not reliable, due to frequent administrative errors. Daily data on meteorological conditions and new cases of COVID-19 were collected for the Lombardy Region (Northern Italy) from 1 March, 2020 to 20 April, 2020. This region exhibited the largest rate of official deaths in the world, with a value of approximately 1700 per million on 30 June 2020. Moreover, the apparent lethality was approximately 17% in this area, mainly due to the considerable housing density and the extensive presence of industrial and craft areas. Both the Mann–Kendall test and multivariate statistical analysis showed that none of the considered climatic variables exhibited statistically significant relationships with the epidemiological evolution of COVID-19, at least during spring months in temperate subcontinental climate areas, with the exception of solar radiation, which was directly related and showed an otherwise low explained variability of approximately 20%. Furthermore, the average temperatures of two highly representative meteorological stations of Molise and Lucania (Southern Italy), the most weakly affected by the pandemic, were approximately 1.5 °C lower than those in Bergamo and Brescia (Lombardy), again confirming that a significant relationship between the increase in temperature and decrease in virulence from COVID-19 is not evident, at least in Italy.
The coronavirus disease 2019 (COVID-19) pandemic is the defining global health and socioeconomic crisis of our time and represents the greatest challenge faced by the world since the end of the Second World War. The academic literature indicates that climatic features, specifically the temperature and absolute humidity, are very important factors affecting infectious pulmonary disease epidemics (e.g., SARS, MERS); however, the influence of climatic parameters on COVID-19 remains extremely controversial. The goal of this study is to quantify the existing relationship between several daily climate parameters (temperature, relative humidity, accumulated precipitation, solar radiation, wind direction and intensity, and evaporation), local morphological parameters, and new daily positive swabs for COVID-19, which represents the only parameter that can be statistically used to quantify the pandemic. The daily deaths parameter was not considered because it is not reliable due to frequent administrative errors. Daily data on meteorological conditions and new cases of COVID-19 were collected for the Lombardy area from March 1, 2020, to April 20, 2020. This region in Italy exhibited the largest number of official deaths in the world per million inhabitants, with a value of approximately 1700 per million on june 30, 2020. Moreover, the apparent lethality was approximately 17% in this area, mainly due to the considerable housing density and the extensive presence of industrial and craft areas. The Mann-Kendall test and multivariate statistical analysis showed that none of the considered climatic variables exhibited statistically significant relationships with the epidemiological evolution of COVID-19, at least in the spring months in temperate subcontinental climate areas, with the exception of solar radiation, which was directly related and showed an otherwise low explained variability of approximately 20%. Furthermore, the average temperatures of two highly representative meteorological stations of Molise and Lucania, the most weakly affected by the pandemic. The temperatures at these stations were approximately 1.5°C lower than that in the cities in Lombardy of Bergamo and Brescia, again confirming that a significant relationship between the increase in temperature and decrease in virology from COVID-19 was not evident, at least in the Italian peninsula.
COVID-19 clinical presentation and prognosis are highly variable, ranging from asymptomatic and paucisymptomatic cases to acute respiratory distress syndrome and multi-organ involvement. We developed a hybrid machine learning/deep learning model to classify patients in two outcome categories, non-ICU and ICU (intensive care admission or death), using 558 patients admitted in a northern Italy hospital in February/May of 2020. A fully 3D patient-level CNN classifier on baseline CT images is used as feature extractor. Features extracted, alongside with laboratory and clinical data, are fed for selection in a Boruta algorithm with SHAP game theoretical values. A classifier is built on the reduced feature space using CatBoost gradient boosting algorithm and reaching a probabilistic AUC of 0.949 on holdout test set. The model aims to provide clinical decision support to medical doctors, with the probability score of belonging to an outcome class and with case-based SHAP interpretation of features importance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.