Proceedings of the 17th International Conference on World Wide Web 2008
DOI: 10.1145/1367497.1367510
|View full text |Cite
|
Sign up to set email alerts
|

Learning to classify short and sparse text & web with hidden topics from large-scale data collections

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
401
0
5

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 648 publications
(407 citation statements)
references
References 29 publications
1
401
0
5
Order By: Relevance
“…Unlike traditional long texts, short texts are of great difficulty in traditional text mining (such as search, subject discovery, semantic, and emotional analysis) due to their short length, low context information and statistical information. Related studies include the use of external data sources (such as Wikipedia [3], search results [4], etc.) to expand the document, or use the internal similar document information to expand the expression of short text [5].…”
Section: Challenges Posed By Network Big Datamentioning
confidence: 99%
“…Unlike traditional long texts, short texts are of great difficulty in traditional text mining (such as search, subject discovery, semantic, and emotional analysis) due to their short length, low context information and statistical information. Related studies include the use of external data sources (such as Wikipedia [3], search results [4], etc.) to expand the document, or use the internal similar document information to expand the expression of short text [5].…”
Section: Challenges Posed By Network Big Datamentioning
confidence: 99%
“…[20]. Phan et al employed a similarity estimation method of short text based on the probability distribution of document topic [21]. Zhang and Zhong collected large-scale external data to build the topic model according to the LDA, which enables word topics to enrich feature representations of short text [22].…”
Section: Related Workmentioning
confidence: 99%
“…However, it focuses meme-tracking [22]. More generally, news processing is an active related domain of research [1,15].…”
Section: Related Workmentioning
confidence: 99%