An Interpretable Machine Learning Framework for Accurate Severe vs Non-severe COVID-19 Clinical Type Classification

Chen, Yuanfang; Ouyang, Liu; Bao, Sheng; Li, Qian; Han, Lei; Zhang, Hengdong; Zhu, Baoli; Xu, Mengyi; Liu, Jie; Ge, Yaorong; Chen, Shi

doi:10.1101/2020.05.18.20105841

Cited by 22 publications

(30 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Prediction models of the prognosis for a given disease have the main objective of supporting the physician's decision-making about what is the best measure of patient referral, assisting in the screening of patients at high risk of progressing to severe disease. Artificial intelligence models aiming to identify risk factors for prognostic prediction of severe COVID-19 have been developed using age, clinical characteristics, laboratory tests and chest imaging [30,31,32,29,33,34].…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Predicting the disease outcome in COVID-19 positive patients through Machine Learning: a retrospective cohort study with Brazilian data

Souza

Hojo-Souza

Santos

et al. 2020

Preprint

View full text Add to dashboard Cite

The first officially registered case of COVID-19 in Brazil was on February 26, 2020. Since then, the situation has worsened with more than 672,000 confirmed cases and at least 36,000 reported deaths at the time of this writing. Accurate diagnosis of patients with COVID-19 is extremely important to offer adequate treatment, and avoid overloading the healthcare system. Characteristics of patients such as age, comorbidities and varied clinical symptoms can help in classifying the level of infection severity, predict the disease outcome and the need for hospitalization. Here, we present a study to predict a poor prognosis in positive COVID-19 patients and possible outcomes using machine learning. The study dataset comprises information of 13,690 patients concerning closed cases due to cure or death. Our experimental results show the disease outcome can be predicted with a ROC AUC of 0.92, Sensitivity of 0.88 and Specificity of 0.82 for the best prediction model. This is a preliminary retrospective study which can be improved with the inclusion of further data. Conclusion: Machine learning techniques fed with demographic and clinical data along with comorbidities of the patients can assist in the prognostic prediction and physician decision-making, allowing a faster response and contributing to the non-overload of healthcare systems.

show abstract

Section: Discussionmentioning

confidence: 99%

“…The addition of biochemical data to symptoms/comorbidities achieved > 99% predictive accuracy. Therefore, it was suggested that symptoms and comorbidities can be used in an initial screening and the biochemical data inclusion could predict the severity degree and assist in the development of treatment plans [29].…”

Section: Discussionmentioning

confidence: 99%

Predicting the disease outcome in COVID-19 positive patients through Machine Learning: a retrospective cohort study with Brazilian data

Souza

Hojo-Souza

Santos

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…An advantage that the RF model had over SVM and kNN models was that it had relatively clearer interpretability, especially when interpreting feature importance. After developing the RF model based on the training set, we were able to rank the importance of input features based on their corresponding Gini impurity score from the RF model [ 40 , 41 ]. It should be noted that only the training set was used to compute Gini impurity, not the test set.…”

Section: Methodsmentioning

confidence: 99%

Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach

Xu¹,

Ouyang²,

Han³

et al. 2021

J Med Internet Res

Self Cite

View full text Add to dashboard Cite

Background Effectively identifying patients with COVID-19 using nonpolymerase chain reaction biomedical data is critical for achieving optimal clinical outcomes. Currently, there is a lack of comprehensive understanding in various biomedical features and appropriate analytical approaches for enabling the early detection and effective diagnosis of patients with COVID-19. Objective We aimed to combine low-dimensional clinical and lab testing data, as well as high-dimensional computed tomography (CT) imaging data, to accurately differentiate between healthy individuals, patients with COVID-19, and patients with non-COVID viral pneumonia, especially at the early stage of infection. Methods In this study, we recruited 214 patients with nonsevere COVID-19, 148 patients with severe COVID-19, 198 noninfected healthy participants, and 129 patients with non-COVID viral pneumonia. The participants’ clinical information (ie, 23 features), lab testing results (ie, 10 features), and CT scans upon admission were acquired and used as 3 input feature modalities. To enable the late fusion of multimodal features, we constructed a deep learning model to extract a 10-feature high-level representation of CT scans. We then developed 3 machine learning models (ie, k-nearest neighbor, random forest, and support vector machine models) based on the combined 43 features from all 3 modalities to differentiate between the following 4 classes: nonsevere, severe, healthy, and viral pneumonia. Results Multimodal features provided substantial performance gain from the use of any single feature modality. All 3 machine learning models had high overall prediction accuracy (95.4%-97.7%) and high class-specific prediction accuracy (90.6%-99.9%). Conclusions Compared to the existing binary classification benchmarks that are often focused on single-feature modality, this study’s hybrid deep learning-machine learning framework provided a novel and effective breakthrough for clinical applications. Our findings, which come from a relatively large sample size, and analytical workflow will supplement and assist with clinical decision support for current COVID-19 diagnostic methods and other clinical applications with high-dimensional multimodal biomedical features.

show abstract

“…The authors use machine learning algorithms to provide assay design for detection of 67 viral species and subspecies of SARS-CoV-2. In [61] , random forest models are used to classify the covid-19 patients.…”

Section: Clinical Applicationsmentioning

confidence: 99%

Applications of artificial intelligence in battling against covid-19: A literature review

2021

Chaos, Solitons & Fractals

161

View full text Add to dashboard Cite

show abstract

An Interpretable Machine Learning Framework for Accurate Severe vs Non-severe COVID-19 Clinical Type Classification

Cited by 22 publications

References 36 publications

Predicting the disease outcome in COVID-19 positive patients through Machine Learning: a retrospective cohort study with Brazilian data

Predicting the disease outcome in COVID-19 positive patients through Machine Learning: a retrospective cohort study with Brazilian data

Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach

Applications of artificial intelligence in battling against covid-19: A literature review

Contact Info

Product

Resources

About