2016
DOI: 10.1186/s13040-016-0114-4
|View full text |Cite
|
Sign up to set email alerts
|

Compensation of feature selection biases accompanied with improved predictive performance for binary classification by using a novel ensemble feature selection approach

Abstract: MotivationBiomarker discovery methods are essential to identify a minimal subset of features (e.g., serum markers in predictive medicine) that are relevant to develop prediction models with high accuracy. By now, there exist diverse feature selection methods, which either are embedded, combined, or independent of predictive learning algorithms. Many preceding studies showed the defectiveness of single feature selection results, which cause difficulties for professionals in a variety of fields (e.g., medical pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
50
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
1
1

Relationship

4
5

Authors

Journals

citations
Cited by 45 publications
(50 citation statements)
references
References 37 publications
0
50
0
Order By: Relevance
“…In order to find features in the output of SEDE-GPS that are predictive for lake microbial biodiversity, we used the R package EFS (Ensemble Feature Selection) and the eight alpha diversity metrics as target variable in separate analyses [13, 14]. EFS is an ensemble feature selection method that assigns weights to the features in an unbiased manner according to their predictiveness for the target value.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to find features in the output of SEDE-GPS that are predictive for lake microbial biodiversity, we used the R package EFS (Ensemble Feature Selection) and the eight alpha diversity metrics as target variable in separate analyses [13, 14]. EFS is an ensemble feature selection method that assigns weights to the features in an unbiased manner according to their predictiveness for the target value.…”
Section: Resultsmentioning
confidence: 99%
“…Next, we used the R package EFS (Ensemble Feature Selection) in order to rank the remaining features according to their importance. EFS is an ensemble learning feature selection method, that corrects for biases of the single methods when weighting features [13, 14]. Although EFS has been developed for feature selection in classification studies, we used an adapted version of EFS, which can be used for regression studies.…”
Section: Methodsmentioning
confidence: 99%
“…Alternatives to the Boruta method are discussed and evaluated in [22], whereby Boruta was identified as best performing technology among the tested ones if used for different dimensionalities of the data. In 2017 Neumann et al presented the Ensemble Feature Selection (EFS) method which combines multiple FS methods to remove individual biases and give aggregated feature relevance ranges for all of them [23], [24].…”
Section: Introductionmentioning
confidence: 99%
“…The following section will briefly introduce our methodology. For deeper insights please refer to [10]. Our EFS currently incorporates eight feature selection methods for binary classifications, namely median, Pearson- and Spearman-correlation, logistic regression, and four variable importance measures embedded in two different implementations of the random forest algorithm, namely cforest [9] and randomForest [13].…”
Section: Methodsmentioning
confidence: 99%