2011
DOI: 10.1007/s10844-011-0172-5
|View full text |Cite
|
Sign up to set email alerts
|

A new unsupervised feature selection method for text clustering based on genetic algorithms

Abstract: Nowadays a vast amount of textual information is collected and stored in various databases around the world, including the Internet as the largest database of all. This rapidly increasing growth of published text means that even the most avid reader cannot hope to keep up with all the reading in a field and consequently the nuggets of insight or new knowledge are at risk of languishing undiscovered in the literature. Text mining offers a solution to this problem by replacing or supplementing the human reader w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(10 citation statements)
references
References 13 publications
0
10
0
Order By: Relevance
“…The reason is more features will attenuate the classification performance [51]. Reducing the number of features can simplify the model, cost shorter training time, and augment generalization performance through reduction of variance [52].…”
Section: Optimal Decomposition Level Selectionmentioning
confidence: 99%
“…The reason is more features will attenuate the classification performance [51]. Reducing the number of features can simplify the model, cost shorter training time, and augment generalization performance through reduction of variance [52].…”
Section: Optimal Decomposition Level Selectionmentioning
confidence: 99%
“…Nevertheless, other unsupervised feature selection methods had been used with text clustering such as Document Frequency (DF), Term Contribution (TC) or Term Variance (TV) among other statistical techniques (Luying et al, 2005;Tang et al, 2005). Moreover, there are recently evolutionary algorithm based optimization methods for term or keyword selection, such as for instance the technique in (Shamsinejadbabki and Saraee, 2012). …”
Section: Dimensional Reductionmentioning
confidence: 99%
“…All the mentioned methods work in three major steps: (1) Define a formula for measuring the discriminative power of a term. (2) Sort the terms based on the value of defined measurement and (3) Choose a number of the terms from top of the list (Shamsinejadbabki and Saraee 2011).…”
Section: Text Miningmentioning
confidence: 99%