2020
DOI: 10.1007/978-3-030-44584-3_14
|View full text |Cite
|
Sign up to set email alerts
|

A Consensus Approach to Improve NMF Document Clustering

Abstract: Nonnegative Matrix Factorization (NMF) which was originally designed for dimensionality reduction has received throughout the years a tremendous amount of attention for clustering purposes in several fields such as image processing or text mining. However, despite its mathematical elegance and simplicity, NMF has exposed a main issue which is its strong sensitivity to starting points, resulting in NMF struggling to converge toward an optimal solution. On another hand, we came to explore and discovered that eve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…However, the optimizing strategy of ( 6) still faces the challenges of high complexity. Specifically, once the matrix Z is learned, calculating the Laplacian matrix requires O(n 2 ) computational complexity during optimizing V. Compared with standard NMF with O(n) complexity, this will limit the scalability of model (6). Importantly, the graph Laplacian regularizer is a Euclidean-based distance metric, which can be reformulated as…”
Section: Motivation and Objective Functionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the optimizing strategy of ( 6) still faces the challenges of high complexity. Specifically, once the matrix Z is learned, calculating the Laplacian matrix requires O(n 2 ) computational complexity during optimizing V. Compared with standard NMF with O(n) complexity, this will limit the scalability of model (6). Importantly, the graph Laplacian regularizer is a Euclidean-based distance metric, which can be reformulated as…”
Section: Motivation and Objective Functionmentioning
confidence: 99%
“…This non-negativity constraints result in the parts-based representation of NMF. In practice, NMF has been applied to many applications, such as document clustering [6], image segmentation [7], text mining [8], image clustering [9,10] and biological data mining [11].…”
Section: Introductionmentioning
confidence: 99%
“…IR has presented several techniques for the processing and retrieval of relevant documents [18]. As in the last few years, we have observed document clustering as a proliferation solution in applying IR based on the assumption that if a document is pertinent to a query, other documents of that cluster can pertain to that query too [19]. It is believed that data storage indexing can be improved via grouping XML documents together which will have a positive effect on the retrieval process [20].…”
Section: Introductionmentioning
confidence: 99%