2014 Science and Information Conference 2014
DOI: 10.1109/sai.2014.6918213
|View full text |Cite
|
Sign up to set email alerts
|

A survey of feature selection and feature extraction techniques in machine learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
529
0
8

Year Published

2015
2015
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 896 publications
(539 citation statements)
references
References 17 publications
2
529
0
8
Order By: Relevance
“…Feature redundancy increases search space size and affects the speed and the accuracy of learning algorithms [49]. The correlation coefficient, which indicates the strength and direction of a relationship between two random variables, is used to measure the redundancy.…”
Section: Evaluation Of Feature Redundancymentioning
confidence: 99%
“…Feature redundancy increases search space size and affects the speed and the accuracy of learning algorithms [49]. The correlation coefficient, which indicates the strength and direction of a relationship between two random variables, is used to measure the redundancy.…”
Section: Evaluation Of Feature Redundancymentioning
confidence: 99%
“…As the name implies, it is the redundant information, such as duplicate instances and derived attributes of others that contain the same information [65][66]. As mentioned above, the algorithms found in the papers to solve the redundancy issue were classified in four categories: unsupervised learning, supervised learning, statistics and others.…”
Section: Redundancymentioning
confidence: 99%
“… Redundancy: as the name implies is the redundant information such as duplicate instances and derived attributes of others that contain the same information [26], [27]. In Table 3 are shown the approaches to solve the problems related to amount of data and redundancy.…”
Section: Journal Of Computersmentioning
confidence: 99%
“…Several approaches exist to tackle the issues of data quality in outliers [22], noise [18], inconsistency [23], incompleteness [24], redundancy [26], [27], amount of data [28]- [30], heterogeneity [14], and timeliness [25]. Nevertheless the results to date not consider resolve the issues in ensemble.…”
Section: Conclusion and Future Researchmentioning
confidence: 99%