Proceedings of the CUBE International Information Technology Conference 2012
DOI: 10.1145/2381716.2381775
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised topic detection model and its application in text categorization

Abstract: In most of the research, topic detection is defined as the task of finding out different themes from the collection of documents. Our topic detection approach is about finding a topic for every document in the corpus. Any word or group of words which tells what the document is about is defined as the topic of the document. In this paper, we propose a novel topic detection approach using an unsupervised model. It is a simple yet effective approach for topic detection and finding keywords from the corpus.The key… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…To date, many scholars have studied this field. Observing the previous related research, the techniques applied in topic detection can be roughly divided into two categories: nonprobability models and probability models [12,34]. A nonprobability model detects the topic by methods such as document clustering and graph analysis.…”
Section: Topic Detectionmentioning
confidence: 99%
“…To date, many scholars have studied this field. Observing the previous related research, the techniques applied in topic detection can be roughly divided into two categories: nonprobability models and probability models [12,34]. A nonprobability model detects the topic by methods such as document clustering and graph analysis.…”
Section: Topic Detectionmentioning
confidence: 99%
“…In [3] after preprocessing the document that is tokenization, normalization and removal of stop words, top 20 highest …”
Section: Related Workmentioning
confidence: 99%
“…In addition, various measures are applied such as term frequency analysis, citation analysis and co-occurrence of keywords or authors. Haribhakta et al, [11] represent documents with keywords and compare the co-occurrence of keywords to detect hot topic. Bu-Yeo Kim et al, [12] compare the co-occurrence of keywords to map the dementia research area at the micro-level.…”
Section: Introductionmentioning
confidence: 99%