2020
DOI: 10.1088/1755-1315/428/1/012024
|View full text |Cite
|
Sign up to set email alerts
|

The Study of an Improved Text Clustering Algorithm for Self-Organizing Maps

Abstract: The traditional SOM algorithm need to determine the number of clustering categories in advance, which is very subjective. In this paper, an improved k-means initial value selection algorithm is proposed to calculate the number of clustering categories, which is applied to SOM network model. In this algorithm, the Latent Semantic Indexing is applied in the pre-processing stage of clustering, and the improved SOM algorithm is applied in the text clustering stage. Namely, the number of clustering categories obtai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 1 publication
0
1
0
Order By: Relevance
“…Data clustering can be achieved through the topology representation of the high-dimensional data, that is, the data vectors with similar distribution characters are grouped together and displayed graphically thanks to the visualization advantage of SOM [4]. Thus, SOM naturally has many applications such as image and text segmentation [5], [13], medical diagnosis [6], [12], [14], target recognition [7], and traffic attack detection [8], [9], [20]. However, previous studies have theoretically predicted that the original simple model of SOM fails to fully represent the topology of the asymmetric-distributed (network traffic) data [4], [10].…”
Section: Introductionmentioning
confidence: 99%
“…Data clustering can be achieved through the topology representation of the high-dimensional data, that is, the data vectors with similar distribution characters are grouped together and displayed graphically thanks to the visualization advantage of SOM [4]. Thus, SOM naturally has many applications such as image and text segmentation [5], [13], medical diagnosis [6], [12], [14], target recognition [7], and traffic attack detection [8], [9], [20]. However, previous studies have theoretically predicted that the original simple model of SOM fails to fully represent the topology of the asymmetric-distributed (network traffic) data [4], [10].…”
Section: Introductionmentioning
confidence: 99%