2021
DOI: 10.1108/dta-11-2020-0262
|View full text |Cite
|
Sign up to set email alerts
|

A scalable eigenspace-based fuzzy c-means for topic detection

Abstract: PurposeThe aim of this research is to develop an eigenspace-based fuzzy c-means method for scalable topic detection.Design/methodology/approachThe eigenspace-based fuzzy c-means (EFCM) combines representation learning and clustering. The textual data are transformed into a lower-dimensional eigenspace using truncated singular value decomposition. Fuzzy c-means is performed on the eigenspace to identify the centroids of each cluster. The topics are provided by transforming back the centroids into the nonnegativ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…Topic detection methods are algorithms for detecting topics or themes in an unstructured collection of documents. This research explore the use of Bidirectional Encoder Representation from Transformer (BERT) [17] in Eigenspace-based Fuzzy C-Means (EFCM) [10][11][12]. Secondly, we use c-TFIDF [20] as topic representation instead of standard centroid method.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Topic detection methods are algorithms for detecting topics or themes in an unstructured collection of documents. This research explore the use of Bidirectional Encoder Representation from Transformer (BERT) [17] in Eigenspace-based Fuzzy C-Means (EFCM) [10][11][12]. Secondly, we use c-TFIDF [20] as topic representation instead of standard centroid method.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, the clustering method is more flexible to be combined with many kinds of representation learning. One of the clustering methods for topic detection is Eigenspace-based Fuzzy C-Means (EFCM) [10][11][12]. EFCM is a clustering-based Topic Detection method consisting of two main parts, i.e., Truncated Singular Value Decomposition (TSVD) for dimension reduction and Fuzzy C-Means Clustering (FCM) for clustering.…”
Section: Introductionmentioning
confidence: 99%
“…Autoencoder based fuzzy c-means algorithm is presented by [37]. Autoencoder is used for representation of tweets while fuzzy c-means is the clustering part of method.…”
Section: Deep Learning-based Methodsmentioning
confidence: 99%