2024
DOI: 10.21203/rs.3.rs-4672501/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Text topic modeling via representation learning non-negative matrix factorization with semantic similarity

Yang Xu,
Yueyi Zhang,
Jing Hu

Abstract: Topic models are instrumental in text mining, revealing discriminative and coherent latent topics. Fewer words in short texts lead to insufficient contextual information and produce a highly sparse document-word matrix. So traditional topic models struggle to effectively cluster short texts. Models incorporating global word co-occurrence introduce too much information when processing long texts, resulting in a decrease in convergence speed and poorer clustering accuracy. To overcome sparsity in short texts and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 23 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?