2021
DOI: 10.1007/s10618-020-00731-7
|View full text |Cite
|
Sign up to set email alerts
|

Extending greedy feature selection algorithms to multiple solutions

Abstract: Most feature selection methods identify only a single solution. This is acceptable for predictive purposes, but is not sufficient for knowledge discovery if multiple solutions exist. We propose a strategy to extend a class of greedy methods to efficiently identify multiple solutions, and show under which conditions it identifies all solutions. We also introduce a taxonomy of features that takes the existence of multiple solutions into account. Furthermore, we explore different definitions of statistical equiva… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 48 publications
0
8
0
Order By: Relevance
“…To fully assess clinical relevance, we next plan to validate these biosignatures in independent, blinded validation cohorts of T2DM patients, obese and prediabetic individuals from a different clinical setting (cross-dataset analysis). We should note, however, that JADBio implements internal cross-validation with the bootstrap corrected cross-validation (BBC-CV) algorithm [36,37], shown to substitute external validation. The bootstrapping technique performs a correction to the estimation of out-of-sample performance of the final model.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…To fully assess clinical relevance, we next plan to validate these biosignatures in independent, blinded validation cohorts of T2DM patients, obese and prediabetic individuals from a different clinical setting (cross-dataset analysis). We should note, however, that JADBio implements internal cross-validation with the bootstrap corrected cross-validation (BBC-CV) algorithm [36,37], shown to substitute external validation. The bootstrapping technique performs a correction to the estimation of out-of-sample performance of the final model.…”
Section: Discussionmentioning
confidence: 99%
“…In fact, JADBio performs this training and test procedure numerous times to reduce the variance of the estimation. Specifically, for small sample sizes, it employees a stratified, K-fold, repeated cross-validation protocol BBC-CV algorithm [37] that exhibits small estimation variance and removes the estimation bias due to the fact that it was selected among many. The features selected in the winning pipeline are the ones included in the returned model.…”
Section: Automl Predictive Modelling With Jadbiomentioning
confidence: 99%
“…Second, in practice, for a given ML problem, multiple equivalent solutions in variable selections can exist [ 60 ]. A shortcoming of some variable selection methods is that they injudiciously identify only a single solution, minimizing a loss function like mean squared error, classification error, etc.…”
Section: Discussionmentioning
confidence: 99%
“…The position update of the operator in the AOA algorithm is carried out through its own experience and the neighborhood experience. In this paper, the algorithm AOA-CS is based on the population evolution strategy and process, and incorporates a random component to perform location updates [22][23][24][25][26]. Each time the operator starts searching, it first updates MOA and MOP values, and then generates three random numbers r1, r2 and r3 that obey uniform distribution between 0 and 1.…”
Section: Operator Position Update Mechanismmentioning
confidence: 99%