2019
DOI: 10.1109/access.2019.2894366
|View full text |Cite
|
Sign up to set email alerts
|

Feature Selection and Its Use in Big Data: Challenges, Methods, and Trends

Abstract: Feature selection has been an important research area in data mining, which chooses a subset of relevant features for use in the model building. This paper aims to provide an overview of feature selection methods for big data mining. First, it discusses the current challenges and difficulties faced when mining valuable information from big data. A comprehensive review of existing feature selection methods in big data is then presented. Herein, we approach the review from two aspects: methods specific to a part… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
46
0
2

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 90 publications
(48 citation statements)
references
References 186 publications
(189 reference statements)
0
46
0
2
Order By: Relevance
“…PSO algorithm is inspired by a flock of birds seeking food. It treats each solution of the optimization problem as a bird that flies at a certain velocity in the search space, and its velocity is adjusted dynamically [38][39][40]. The bird is abstracted as a particle without weight and volume, and the location of the i-th particle in all the n dimensions is represented as…”
Section: Canonical Psomentioning
confidence: 99%
“…PSO algorithm is inspired by a flock of birds seeking food. It treats each solution of the optimization problem as a bird that flies at a certain velocity in the search space, and its velocity is adjusted dynamically [38][39][40]. The bird is abstracted as a particle without weight and volume, and the location of the i-th particle in all the n dimensions is represented as…”
Section: Canonical Psomentioning
confidence: 99%
“…Relief algorithms, one of the most successful preprocessing algorithms, can select attributes effectively without any assumption of attribute independence. However, they are restricted to two-class problems [45][46][47][48][49]. The Relief-F algorithm, which is extended from the relief algorithm and suitable for handling multi-class problems, is applied in this paper to further select attributes.…”
Section: Of 13mentioning
confidence: 99%
“…Hence, it is recommended to adoption of dimensionality reduction or features selection techniques [38]. Dimensionality reduction techniques combine original features to provide a smaller number of features with enhanced predictive power.…”
Section: Feature Selectionmentioning
confidence: 99%