2014
DOI: 10.1007/978-81-322-1602-5_154
|View full text |Cite
|
Sign up to set email alerts
|

A Survey on Filter Techniques for Feature Selection in Text Mining

Abstract: A large portion of a document is usually covered by irrelevant features. Instead of identifying actual context of the document, such features increase dimensions in the representation model and computational complexity of underlying algorithm, and hence adversely affect the performance. It necessitates a requirement of relevant feature selection in the given feature space. In this context, feature selection plays a key role in removing irrelevant features from the original feature space. Feature selection meth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 26 publications
0
7
0
Order By: Relevance
“…Feature selection is one of the important tasks in text classification due to the high dimensionality of feature space and the existence of indiscriminative features [1]. Feature selection methods are able to reduce the high-dimensional indiscriminative feature space into low-dimensional discriminative feature subspace [2]. For example, in (2), feature "beautiful" and feature "morning" may be discriminative features and then a six-dimensional DTM can be reduced into a two-dimensional one.…”
Section: Dtm Of Documentsmentioning
confidence: 99%
See 3 more Smart Citations
“…Feature selection is one of the important tasks in text classification due to the high dimensionality of feature space and the existence of indiscriminative features [1]. Feature selection methods are able to reduce the high-dimensional indiscriminative feature space into low-dimensional discriminative feature subspace [2]. For example, in (2), feature "beautiful" and feature "morning" may be discriminative features and then a six-dimensional DTM can be reduced into a two-dimensional one.…”
Section: Dtm Of Documentsmentioning
confidence: 99%
“…This paper focuses on filter feature selection and all the methods introduced in this paper are filter feature selection methods. Filter methods are chosen because they are more computationally efficient than wrapper methods [2]. A lot of feature selection methods have been proposed.…”
Section: Dtm Of Documentsmentioning
confidence: 99%
See 2 more Smart Citations
“…Embedded methods are some feature selection methods included in machine learning algorithms, such as feature selection method based on information gain used in decision tree classification algorithm. Feature selection has achieved good results in many fields, such as gene array analysis [11], intrusion detection [12], and text mining [13].…”
Section: Introductionmentioning
confidence: 99%