2020 5th International Conference on Communication and Electronics Systems (ICCES) 2020
DOI: 10.1109/icces48766.2020.9137928
|View full text |Cite
|
Sign up to set email alerts
|

Text Document Clustering Using K-means Algorithm with Dimension Reduction Techniques

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…This work uses TF-IDF, SVM, NMF, and k-means clustering techniques to categorize data. Results are compared using 20 newsgroup datasets, demonstrating the effectiveness of these techniques [9]. Document clustering is a classification process that operates without the need for supervision, wherein documents are grouped together into separate groups based on their similarities and differences.…”
Section: Literature Reviewmentioning
confidence: 99%
“…This work uses TF-IDF, SVM, NMF, and k-means clustering techniques to categorize data. Results are compared using 20 newsgroup datasets, demonstrating the effectiveness of these techniques [9]. Document clustering is a classification process that operates without the need for supervision, wherein documents are grouped together into separate groups based on their similarities and differences.…”
Section: Literature Reviewmentioning
confidence: 99%
“…In the previous works presented for opinion texts clustering using dimension reduction, mostly linear dimension reduction methods including principal component analysis (PCA), singular value decomposition (SVD), nonnegative matrix factorization (NMF), and linear discriminant analysis (LDA) have been used, and also in some cases, feature selection method has been applied. In [25][26][27][28][29][30], first, dimensions of the opinion texts were reduced by linear dimension reduction methods, and then, clustering was performed. In [25], the TF-IDF model was used to represent the text for clustering; then, dimensions were reduced using the SVD and NMF methods.…”
Section: Opinion Texts Clustering By Dimension Reductionmentioning
confidence: 99%
“…In [25][26][27][28][29][30], first, dimensions of the opinion texts were reduced by linear dimension reduction methods, and then, clustering was performed. In [25], the TF-IDF model was used to represent the text for clustering; then, dimensions were reduced using the SVD and NMF methods. Finally, clustering was performed on the given dimension-reduced texts by the K-Means algorithm.…”
Section: Opinion Texts Clustering By Dimension Reductionmentioning
confidence: 99%
“…This demonstrates that the proposed strategy improves the grouping of English texts in text documents. Large data dimensions are a barrier to extracting relevant information, according to RutujaKumbhar and SnehalMhamane [4], hence the dimensions of data matrices are reduced using the dimensionality reduction (DR) technique. TF-IDF, SVD, kmeans clustering and NMF are used to further partition data into clusters using the k-means clustering algorithm.…”
Section: Related Workmentioning
confidence: 99%