Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining 2018
DOI: 10.1145/3219819.3220094
|View full text |Cite
|
Sign up to set email alerts
|

Model-based Clustering of Short Text Streams

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
47
0
4

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 65 publications
(52 citation statements)
references
References 25 publications
1
47
0
4
Order By: Relevance
“…LDA has been widely utilized for detecting topics in microposts. Some of these approaches associate a single topic to an individual post [36,37], whereas others consider a document to be a collection of short posts that are aggregated by some criteria like the same author [38], temporal or geographical proximity [39], or content similarity that indicates some relevance (i.e., hashtag or keyword) [40]. Fig 2 shows the top ten words of some LDA topics resulting from tweets that we collected during the 2016 U.S. presidential debates (produced by Twit-terLDA [41]).…”
Section: Related Workmentioning
confidence: 99%
“…LDA has been widely utilized for detecting topics in microposts. Some of these approaches associate a single topic to an individual post [36,37], whereas others consider a document to be a collection of short posts that are aggregated by some criteria like the same author [38], temporal or geographical proximity [39], or content similarity that indicates some relevance (i.e., hashtag or keyword) [40]. Fig 2 shows the top ten words of some LDA topics resulting from tweets that we collected during the 2016 U.S. presidential debates (produced by Twit-terLDA [41]).…”
Section: Related Workmentioning
confidence: 99%
“…The authors stated that precision and recall in those events could be significantly improved when considering denser streams, possibly by using the geolocalisation of texts to better separate synchronous events, though it remains to be evaluated. Yin et al (2018) also aimed at tackling the explosive growth, sparsity and concept drift of short-texts in social media by proposing a stream clustering technique based on the Dirichlet process multinomial mixture model (named MStream). The MStream technique assigns a short-text to either an existing cluster or a new one based on the probabilities computed by the Dirichlet model.…”
Section: Word/character-based Techniquesmentioning
confidence: 99%
“…In response to the mentioned challenges, feature selection techniques should be applied and new learning approaches specifically designed for short-texts should be developed. In this regard, several classification (Yuan et al, 2012;Collins et al, 2015;Cui et al, 2016) and clustering (Yang & Ng, 2009;Ni et al, 2011;Weng & Lee, 2011;Tu & Ding, 2012;Li et al, 2012;Kim et al, 2012;Parikh & Karlapalem, 2013;Wang et al, 2016b;Jia et al, 2018;Yin et al, 2018) techniques included preprocessing steps aiming at reducing the dimensionality of the feature space. In all cases, the applied pre-processing included traditional techniques applied to long-texts.…”
Section: Curse Of Dimensionalitymentioning
confidence: 99%
See 2 more Smart Citations