2016 IEEE Tenth International Conference on Semantic Computing (ICSC) 2016
DOI: 10.1109/icsc.2016.8
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Document Clustering Using a Similarity Graph

Abstract: Document clustering addresses the problem of identifying groups of similar documents without human supervision. Unlike most existing solutions that perform document clustering based on keywords matching, we propose an algorithm that considers the meaning of the terms in the documents. For example, a document that contains the words "dog" and "cat" multiple times may be placed in the same category as a document that contains the word "pet" even if the two documents share only noise words in common. Our semantic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
11
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
3
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 27 publications
0
11
0
Order By: Relevance
“…As [40], the approach for semantic document clustering based on the graph similarity. Graph is mainly used information from WordNet to the degree of semantic similarity between 150,000 of the most common in English terminology.…”
Section: Literature Reviewmentioning
confidence: 99%
“…As [40], the approach for semantic document clustering based on the graph similarity. Graph is mainly used information from WordNet to the degree of semantic similarity between 150,000 of the most common in English terminology.…”
Section: Literature Reviewmentioning
confidence: 99%
“…A preliminary version of this article was published in the conference proceedings of the Tenth IEEE International Conference on Semantic Computing [40]. Here, the paper is signi¯cantly revised, corrections are made, and more detailed explanations are provided in every section.…”
Section: Related Researchmentioning
confidence: 99%
“…Our approach di®ers because we use supervised learning only for ¯ne-tuning the algorithm. For example, our original algorithm in [40] is unsupervised, it does not use a training set, and it can cluster documents in any number of classes rather than just classify the documents in preexisting categories.…”
Section: Related Researchmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors of [3] for example try to cluster documents from MEDLINE by using evolutionary algorithms, whereas [4] use machine learning approaches. Only few authors like [5] use graph-based approaches. Some authors, like [6] cover related problems like clustering in the context of search queries, whereas [7] work on the field of hierarchical clusterings.…”
mentioning
confidence: 99%