2011
DOI: 10.1016/j.datak.2011.03.009
|View full text |Cite
|
Sign up to set email alerts
|

Extracting hot spots of topics from time-stamped documents

Abstract: Identifying time periods with a burst of activities related to a topic has been an important problem in analyzing time-stamped documents. In this paper, we propose an approach to extract a hot spot of a given topic in a time-stamped document set. Topics can be basic, containing a simple list of keywords, or complex. Logical relationships such as and, or, and not are used to build complex topics from basic topics. A concept of presence measure of a topic based on fuzzy set theory is introduced to compute the am… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0
1

Year Published

2012
2012
2024
2024

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 30 publications
0
4
0
1
Order By: Relevance
“…Finally, a third limitation of most ETD systems relates to the use of a unitary static monolithic text corpus from human-maintained indexed databases, such as INSPEC, topic detection and tracking (TDT) or COMPENDEX (Nowell et al, 1997;Lent et al, 1997;Swan and Jensen, 2000;Wong et al, 2000;Kumaran and Allan, 2004;Mei and Zhai, 2005;Zhang et al, 2007;Subasic and Berendt, 2010;Chen and Chundi, 2011). A closed static textual data corpus suffers from limited diversity, variety and richness and must be periodically refreshed, imposing major drawbacks, such as data coverage and indexer effect (Alexa, 1997;Zweigenbaum et al, 2001;Banko and Brill, 2001;Keller and Lapata, 2003).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Finally, a third limitation of most ETD systems relates to the use of a unitary static monolithic text corpus from human-maintained indexed databases, such as INSPEC, topic detection and tracking (TDT) or COMPENDEX (Nowell et al, 1997;Lent et al, 1997;Swan and Jensen, 2000;Wong et al, 2000;Kumaran and Allan, 2004;Mei and Zhai, 2005;Zhang et al, 2007;Subasic and Berendt, 2010;Chen and Chundi, 2011). A closed static textual data corpus suffers from limited diversity, variety and richness and must be periodically refreshed, imposing major drawbacks, such as data coverage and indexer effect (Alexa, 1997;Zweigenbaum et al, 2001;Banko and Brill, 2001;Keller and Lapata, 2003).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Sun et al [16] proposed a novel algorithm for BTD within multiple data streams via a network-based method. Chen and Chundi [17] found a technique to extract topic hot spots from text based on a time-stamped sequence. Takahashi et al [4] proposed a method to discover emerging topics from social streams based on an analysis of Link-Anomaly Detection.…”
Section: The Related Literaturementioning
confidence: 99%
“…Although identifying temporal information from text documents is an active area of research (Chundi and Rosenkrantz, 2006;Chen and Chundi, 2011), to the best of our knowledge, identifying temporally changing keyword relationships from document sets is still an emerging field. In and , Kage and Sumiya define several temporal relationships among keywords, such as co-occurring, ordered, and define an approach to determine these relationships.…”
Section: Related Workmentioning
confidence: 99%