Proceedings of the First EAI International Conference on Computer Science and Engineering 2017
DOI: 10.4108/eai.27-2-2017.152282
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Text Feature Selection Technique Based on Particle Swarm Optimization Algorithm for Improving the Text Clustering

Abstract: After incensing the amount of text information on internet web pages, the dealing with this information is very complex due to the volume of information. Text clustering technique is an appropriate task to deal with a huge amount of text documents by grouping set of documents into groups. Text documents contain uninformative features, which decrease the performance of the text clustering technique. Feature selection is an unsupervised technique used to select informative features by creating a new subset of in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…Abualigah et al [96] proposed a new technique for solving the feature selection problem based on the HS algorithm to search for the best subset of informative features. Then, the proposed method is utilized for clustering texts and is summarized as FSHSTC, FS technique using HS algorithm for the TC technique, where it can overcome the other methods drawbacks in improving the performance of the text clustering.…”
Section: Harmony Search (Hs)mentioning
confidence: 99%
“…Abualigah et al [96] proposed a new technique for solving the feature selection problem based on the HS algorithm to search for the best subset of informative features. Then, the proposed method is utilized for clustering texts and is summarized as FSHSTC, FS technique using HS algorithm for the TC technique, where it can overcome the other methods drawbacks in improving the performance of the text clustering.…”
Section: Harmony Search (Hs)mentioning
confidence: 99%
“…Text data clustering is one of the problems of information retrieval [1,2,3].Text data clustering is used to group unordered text documents [1,4]. The purpose of cluster analysis of text data is to detect groups of semantically similar documents among a collection of given text data without predefined categories (characteristics) of grouping.…”
Section: Introductionmentioning
confidence: 99%
“…In the literature, there are many scientific approaches to text data clustering [3,5,6,7,8,9,10]. The k-means local search algorithm is most widely used [8].…”
Section: Introductionmentioning
confidence: 99%
“…Embedded methods works with linear classifiers such as SVM are embedded in the algorithm as expanded functionality. It is also able to capture dependencies at a lower computational cost than other methods [1], [38], [50], [51]. This paper is organized as follows: A general description of the feature selection problems is presented in Section 2.…”
Section: Introductionmentioning
confidence: 99%