2022
DOI: 10.1007/978-3-031-13643-6_2
|View full text |Cite
|
Sign up to set email alerts
|

Tracking News Stories in Short Messages in the Era of Infodemic

Abstract: Tracking news stories in documents is a way to deal with the large amount of information that surrounds us everyday, to reduce the noise and to detect emergent topics in news. Since the Covid-19 outbreak, the world has known a new problem: infodemic. News article titles are massively shared on social networks and the analysis of trends and growing topics is complex. Grouping documents in news stories lowers the number of topics to analyse and the information to ingest and/or evaluate. Our study proposes to ana… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 21 publications
(46 reference statements)
0
1
0
Order By: Relevance
“…We compare the semantic content of our embedded messages and paragraphs utilizing cosine similarity (Song et al 2020). As found in previous works (Hanley, Kumar, and Durumeric 2023b;Song et al 2020;Bernard et al 2022), a cosine similarity threshold between 0.60 and 0.80 can be utilized to determine whether two pieces of text are about the same topic. For instance, with the same model, Phan et al (2022), found that a threshold near 0.715 achieved the best results.…”
Section: Comparing Semantic Contentmentioning
confidence: 99%
“…We compare the semantic content of our embedded messages and paragraphs utilizing cosine similarity (Song et al 2020). As found in previous works (Hanley, Kumar, and Durumeric 2023b;Song et al 2020;Bernard et al 2022), a cosine similarity threshold between 0.60 and 0.80 can be utilized to determine whether two pieces of text are about the same topic. For instance, with the same model, Phan et al (2022), found that a threshold near 0.715 achieved the best results.…”
Section: Comparing Semantic Contentmentioning
confidence: 99%