2015
DOI: 10.1155/2015/246139
|View full text |Cite
|
Sign up to set email alerts
|

Evolutionary Feature Selection for Big Data Classification: A MapReduce Approach

Abstract: Nowadays, many disciplines have to deal with big datasets that additionally involve a high number of features. Feature selection methods aim at eliminating noisy, redundant, or irrelevant features that may deteriorate the classification performance. However, traditional methods lack enough scalability to cope with datasets of millions of instances and extract successful results in a delimited time. This paper presents a feature selection algorithm based on evolutionary computation that uses the MapReduce parad… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
61
0
1

Year Published

2016
2016
2023
2023

Publication Types

Select...
7
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 120 publications
(62 citation statements)
references
References 30 publications
0
61
0
1
Order By: Relevance
“…Each row of this matrix shows the sequence of dataset samples on the machine corresponding to that row. Each column of this matrix shows the sum of characteristics in big dataset sample classification or prediction process.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…Each row of this matrix shows the sequence of dataset samples on the machine corresponding to that row. Each column of this matrix shows the sum of characteristics in big dataset sample classification or prediction process.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…• Peralta et al [28] used the MapReduce model to implement a wrapperbased evolutionary search FS method. The dataset was split by instances and the FS method was applied to each resulting subset.…”
Section: Background and Related Workmentioning
confidence: 99%
“…For this reason, several approaches have been proposed to enable data reduction techniques to tackle big datasets based on Hadoop MapReduce. Concretely, based on k-NN, we can find an approach in [24] to perform feature selection on big datasets using the k-NN rule within a evolutionary mode. And in [25], a framework named MRPR was designed to enable instance reduction techniques to be applied on big datasets.…”
Section: B Data Reduction With the K-nn Algorithmmentioning
confidence: 99%