2022
DOI: 10.1587/transinf.2021edp7144
|View full text |Cite
|
Sign up to set email alerts
|

A Method of K-Means Clustering Based on TF-IDF for Software Requirements Documents Written in Chinese Language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…The experimental analysis above shows that the extraction of TF-IDF variables results in a multi-marketplace product categorization that is close to the optimal value of the cluster if the grouping formed is four clusters. This is to research conducted [22] that variable extraction is very influential in clustering models, and TF-IDF is a variable extraction method that can provide significant results [42]. Meanwhile, the application of PCA also has a significant influence on cluster results, namely on TF-IDF with a PCA value of 70, and the use of Python can optimize the data so that the results provided can be better.…”
Section: Discussionmentioning
confidence: 99%
“…The experimental analysis above shows that the extraction of TF-IDF variables results in a multi-marketplace product categorization that is close to the optimal value of the cluster if the grouping formed is four clusters. This is to research conducted [22] that variable extraction is very influential in clustering models, and TF-IDF is a variable extraction method that can provide significant results [42]. Meanwhile, the application of PCA also has a significant influence on cluster results, namely on TF-IDF with a PCA value of 70, and the use of Python can optimize the data so that the results provided can be better.…”
Section: Discussionmentioning
confidence: 99%
“…TF-IDF is one of the traditional methods based on statistics [21]. It has been used in many different applications, such as document clustering [22], text classification [23], detection of domain name generation algorithms [24], and comparing research trends [25]. Term frequency or word frequency is a rarer method used in information retrieval systems compared to TF-IDF.…”
Section: Related Workmentioning
confidence: 99%