2018
DOI: 10.48550/arxiv.1809.00540
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multilingual Clustering of Streaming News

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…Recent approaches to TDT have explored both sparse and dense features. Miranda et al (2018) proposes an online clustering method that represents documents with TF-IDF features, and demonstrates high performance on a benchmark news article data set. Building on this work, Staykovski et al (2019) compares sparse TF-IDF features with dense Doc2Vec representations, showing a sizeable improvement on the standard data set according to the BCubed evaluation metric.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Recent approaches to TDT have explored both sparse and dense features. Miranda et al (2018) proposes an online clustering method that represents documents with TF-IDF features, and demonstrates high performance on a benchmark news article data set. Building on this work, Staykovski et al (2019) compares sparse TF-IDF features with dense Doc2Vec representations, showing a sizeable improvement on the standard data set according to the BCubed evaluation metric.…”
Section: Related Workmentioning
confidence: 99%
“…Some online approaches combine the time element implicitly by sorting documents in chronological order, dividing them with time slicing, and processing each slice (Allan et al 1998;Yang, Pierce, and Carbonell 1998;Dai, He, and Sun 2010;Hu et al 2017). Other work uses decay functions to extract sparse time features (Yang, Pierce, and Carbonell 1998;Brants, Chen, and Farahat 2003;Li et al 2005;He et al 2010;Ribeiro, Ferret, and Tannier 2017;Miranda et al 2018;Saravanakumar et al 2021). None of the previous work has used temporal embeddings to represent time for the TDT task.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations