2020
DOI: 10.1186/s12859-020-3361-9
|View full text |Cite
|
Sign up to set email alerts
|

Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms

Abstract: Background: With modern methods in biotechnology, the search for biomarkers has advanced to a challenging statistical task exploring high dimensional data sets. Feature selection is a widely researched preprocessing step to handle huge numbers of biomarker candidates and has special importance for the analysis of biomedical data. Such data sets often include many input features not related to the diagnostic or therapeutic target variable. A less researched, but also relevant aspect for medical applications are… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 23 publications
(17 citation statements)
references
References 27 publications
0
17
0
Order By: Relevance
“…To avoid this problem, we recommend using an adapted benefit-cost ratio, such as the ones proposed in Jagdhuber et al [7] or Min et al [13]. The main alternative solution to incorporate costs is a weighted linear combination as mentioned in the introduction of this paper.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…To avoid this problem, we recommend using an adapted benefit-cost ratio, such as the ones proposed in Jagdhuber et al [7] or Min et al [13]. The main alternative solution to incorporate costs is a weighted linear combination as mentioned in the introduction of this paper.…”
Section: Discussionmentioning
confidence: 99%
“…Min et al [13,14] presents cost-sensitive feature selection heuristics and also provide a thorough problem definition in the context of rough sets. Jagdhuber et al [7] and Liu et al [11] further extend this idea and propose genetic algorithms with fixed feature cost budgets. For situations without a fixed cost limit, the goal may be to harmonize costs of features and costs of prediction errors by identifying an optimal trade-off.…”
Section: Introductionmentioning
confidence: 99%
“…A recent wrapper proposal from Zhang et al [2019] used an artificial bee colony algorithm for subset exploration. When imposing a total cost limitation (a.k.a., hard-margin), Jagdhuber et al [2020] proposed a modified genetic algorithm as well as a greedy forward selection approach based on the Akaike Information Criterion. Our goal with such methods was to select a subset of summaries with low computational cost that do not alter the accuracy of the classifier compared to the version that ignores cost.…”
Section: Reference Table and Feature Selectionmentioning
confidence: 99%
“…These works have tended to provide approximate methods or heuristics for solving the cost-constrained optimization problems associated with such tasks 14 – 17 . In healthcare applications, a number of previous works have introduced budget constraints, such as the financial costs of lab tests or clinical preferences, into their proposed machine learning models 18 , 19 . However, to the best of our knowledge, our work is the first to consider cost-sensitive learning applied to COVID-19 .…”
Section: Introductionmentioning
confidence: 99%