2011
DOI: 10.4018/jiit.2011070105
|View full text |Cite
|
Sign up to set email alerts
|

An Ontology Based Model for Document Clustering

Abstract: Clustering is an important topic to find relevant content from a document collection and it also reduces the search space. The current clustering research emphasizes the development of a more efficient clustering method without considering the domain knowledge and user’s need. In recent years the semantics of documents have been utilized in document clustering. The discussed work focuses on the clustering model where ontology approach is applied. The major challenge is to use the background knowledge in the si… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 50 publications
0
9
0
Order By: Relevance
“…F-measure is evaluated using the Equation 3. A higher F-measure indicates better performance [21,22].…”
Section: Clustering Evaluationmentioning
confidence: 99%
“…F-measure is evaluated using the Equation 3. A higher F-measure indicates better performance [21,22].…”
Section: Clustering Evaluationmentioning
confidence: 99%
“…Well-known methods like 'tf-idf' (Salton & McGill, 1983) are only suitable in a pre-processing manner because other than the fact that it is not probabilistic, it also holds a small amount of reduction which would lead to unsatisfying scaling behavior in incremental load environments (Rahman, 2010) considering the fast growing database of Twitter streams our research is based on. Other non-probabilistic models include fuzzy based clustering of documents (Thangamani & Thangaraj, 2011) or ontology based models for document segmentation (Sridevi & Nagaveni, 2011). The 'pLSI' model by Hofmann (1999), while probabilistic in its nature, lacks a representation on a document level.…”
Section: Exploring Microblog Entries With Topic Modelsmentioning
confidence: 99%
“…Keyword based search mechanism is improved by the use of Ontologies. PSO based clustering mechanism is used to group documents based on their similarity score which improves the relevancy of documents (Sridevi & Nagaveni, 2011). According to Potok et al, Hybrid PSO and K-Means based clustering improves document relevancy.…”
Section: Related Workmentioning
confidence: 99%