2020
DOI: 10.1016/j.csda.2019.106839
|View full text |Cite
|
Sign up to set email alerts
|

Benchmark for filter methods for feature selection in high-dimensional classification data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
288
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 505 publications
(290 citation statements)
references
References 45 publications
2
288
0
Order By: Relevance
“…This value is then used as a measure for the discriminative power of the respective feature. Further filter methods implemented in this paper were chosen according to the recommendations of a recent benchmark study by Bommert et al [10]. They compared 22 filter methods on 16 high-dimensional data sets with respect to predictive performance and runtime.…”
Section: Implementation Of Baseline Filter Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…This value is then used as a measure for the discriminative power of the respective feature. Further filter methods implemented in this paper were chosen according to the recommendations of a recent benchmark study by Bommert et al [10]. They compared 22 filter methods on 16 high-dimensional data sets with respect to predictive performance and runtime.…”
Section: Implementation Of Baseline Filter Methodsmentioning
confidence: 99%
“…The former two methods use an entropy based feature evaluation, while the latter one assesses the node impurity of random forests. For further information on these methods see Brown et al [14], Izenmann et al [15], or Bommert et al [10]. All analyzed filter methods are used with their implementation within the R package mlr [16].…”
Section: Implementation Of Baseline Filter Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…is as defined in Eq. (5). As there are datasets with millions of features we seek an algorithm to select f t with linear complexity.…”
Section: Iterative Feature Selectionmentioning
confidence: 99%
“…Among the three main feature selection methods, filter methods are preferred to wrapper and embedded methods in applications where the computational efficiency, classifier independence, simplicity, ease of use and the stability of the results are required. Therefore, filter feature selection remains an interesting topic in many recent research areas such as biomarker identification for cancer prediction and drugs discovery, text classification and predicting defective software [3][4][5]10,11,16,18] and has growing interest in big data applications [19]; according to the Google Scholar search results, the number of research papers published related to filter methods in year 2018 is ∼1,800 of which ∼170 are in gene selection area.…”
Section: Introductionmentioning
confidence: 99%