2023
DOI: 10.1038/s41598-023-28579-z
|View full text |Cite
|
Sign up to set email alerts
|

Potential and limitations of machine meta-learning (ensemble) methods for predicting COVID-19 mortality in a large inhospital Brazilian dataset

Abstract: The majority of early prediction scores and methods to predict COVID-19 mortality are bound by methodological flaws and technological limitations (e.g., the use of a single prediction model). Our aim is to provide a thorough comparative study that tackles those methodological issues, considering multiple techniques to build mortality prediction models, including modern machine learning (neural) algorithms and traditional statistical techniques, as well as meta-learning (ensemble) approaches. This study used a … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…While the above-mentioned techniques for reducing the number of variables can eliminate redundant and irrelevant features, de Paiva, Pereira, and de Andrade argue that it is not always clear whether these methods result in improvements in the predictive power of ML models [65]. Furthermore, as these methods project the features to a new dimension and the features in the new dimension become mixed features, these new features might not necessarily provide a strong explanatory basis [66].…”
Section: Methods [Source] Explanationmentioning
confidence: 99%
“…While the above-mentioned techniques for reducing the number of variables can eliminate redundant and irrelevant features, de Paiva, Pereira, and de Andrade argue that it is not always clear whether these methods result in improvements in the predictive power of ML models [65]. Furthermore, as these methods project the features to a new dimension and the features in the new dimension become mixed features, these new features might not necessarily provide a strong explanatory basis [66].…”
Section: Methods [Source] Explanationmentioning
confidence: 99%
“…Ensemble learning, which combines the predictive abilities of two or more base learner models to reduce bias and variance, thereby improving overall prediction performance, has seen recent application in analyzing COVID-19 data. However, determining the optimal combination of models for optimal performance remains challenging, with most researchers manually constructing stacking ensemble models to predict COVID-19's clinical severity [32][33][34][35]. Additionally, some studies opted for simplified severity classifications related only to survival and death, rather than attempting to classify a diverse range of clinical severities [36,37].…”
Section: Related Workmentioning
confidence: 99%
“…When this occurs, the boosting algorithm may ignore the vast majority of the training data in favor of the few misclassified observations. As a result, general accuracy in classification tends to drop (17) . In such situation, good solution is using robust boosting (RobustBoost).…”
Section: Classifiermentioning
confidence: 99%