Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2007
DOI: 10.1145/1277741.1277760
|View full text |Cite
|
Sign up to set email alerts
|

Regularized clustering for documents

Abstract: In recent years, document clustering has been receiving more and more attentions as an important and fundamental technique for unsupervised document organization, automatic topic extraction, and fast information retrieval or filtering. In this paper, we propose a novel method for clustering documents using regularization. Unlike traditional globally regularized clustering methods, our method first construct a local regularized linear label predictor for each document vector, and then combine all those local re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
30
0

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(30 citation statements)
references
References 26 publications
0
30
0
Order By: Relevance
“…In fact, by combining multiple kernels and exploiting the complementary information revealed from the different kernels, the LLC-mkl indeed improves the robustness and accuracy of the LLC. Moreover, the text data are believed to lie on a low-dimensional manifold because it is impossible for them to fill in the entire high-dimensional space [4]. We can also observe that the performance of LLC-mkl, which utilizes the manifold information, is consistently superior to that of NMF-mkl, which does not utilize such information.…”
mentioning
confidence: 85%
“…In fact, by combining multiple kernels and exploiting the complementary information revealed from the different kernels, the LLC-mkl indeed improves the robustness and accuracy of the LLC. Moreover, the text data are believed to lie on a low-dimensional manifold because it is impossible for them to fill in the entire high-dimensional space [4]. We can also observe that the performance of LLC-mkl, which utilizes the manifold information, is consistently superior to that of NMF-mkl, which does not utilize such information.…”
mentioning
confidence: 85%
“…Matrix-factorization provides a natural way of achieving this goal. It has also been shown both theoretically and experimentally [33,93] that the matrix-factorization technique is equivalent to another graph-structure based document clustering technique known as spectral clustering. An analogous technique called concept factorization was proposed in [98], which can also be applied to data points with negative values in them.…”
Section: Non-negative Matrix Factorizationmentioning
confidence: 99%
“…However, this method cannot be kernelized because the NMF must be performed in the original feature space of the data points. Wang et al [9] used clustering with local and global regularization (CLGR), which uses local label predictors and global label smoothing regularizers. They achieved satisfactory results because the CLGR algorithm uses fixed neighborhood sizes.…”
Section: Related Workmentioning
confidence: 99%
“…They achieved satisfactory results because the CLGR algorithm uses fixed neighborhood sizes. However, the different neighborhood sizes cause the final clustering results to deteriorate [9].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation