2011
DOI: 10.1016/j.ins.2011.01.029
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced clustering of biomedical documents using ensemble non-negative matrix factorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
20
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 41 publications
(20 citation statements)
references
References 33 publications
0
20
0
Order By: Relevance
“…Thus, document clustering techniques, being an efficient way of navigating and summarizing documents, have been intensively investigated in biomedical research. As a dimension reduction method, non-negative matrix factorization [11] has been widely applied to medical document clustering [12,13]. By imposing nonnegativity constraints in both basis and weight factorization matrices, NMF guarantees to preserve the local structure of the original data.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, document clustering techniques, being an efficient way of navigating and summarizing documents, have been intensively investigated in biomedical research. As a dimension reduction method, non-negative matrix factorization [11] has been widely applied to medical document clustering [12,13]. By imposing nonnegativity constraints in both basis and weight factorization matrices, NMF guarantees to preserve the local structure of the original data.…”
Section: Related Workmentioning
confidence: 99%
“…Many extensions of the basic NMF method have also been explored for clustering biomedical documents. For instance, in [13], Multi-view NMF, which can integrate different data sources, was applied for clustering clinical document, based on medication/symptom names, whereas, in [12], ensemble NMF, able to achieve a consensus solution from a set of runs with different initial conditions, was tested on the TREC genomic 2004 track. Finally, also more complex techniques were recently introduced in order to cope with graph representations of medical documents [14].…”
Section: Related Workmentioning
confidence: 99%
“…In general, textual data clustering has been affected by the 'vector space model' [62] where each document is considered a "bag of words" and represented by a weighted vector to facilitate the similarity computation [63].…”
Section: Medline Clusteringmentioning
confidence: 99%
“…MEDLINE (See Note 2) is the largest biomedical literature database in the world, which contains more than 24 million citations. MeSH terms are used to index almost all MEDLINE citations [1], which is crucial in biomedical text mining and information retrieval [2][3][4][5][6][7][8]. The NLM annotators who are responsible for annotating the MeSHs need to review the full text of a citation, which costs lots of time and money.…”
Section: Introductionmentioning
confidence: 99%