2019
DOI: 10.1111/cgf.13714
|View full text |Cite
|
Sign up to set email alerts
|

Topic Tomographies (TopTom): a visual approach to distill information from media streams

Abstract: In this paper we present Top Tom, a digital platform whose goal is to provide analytical and visual solutions for the exploration of a dynamic corpus of user‐generated messages and media articles, with the aim of i) distilling the information from thousands of documents in a low‐dimensional space of explainable topics, ii) cluster them in a hierarchical fashion while allowing to drill down to details and stories as constituents of the topics, iii) spotting trends and anomalies. Top Tom implements a batch proce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2
1
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…Although there are many approaches to temporal and hierarchical topic modeling [104][105][106], we choose to apply NMF to the dataset, and then build time-varying intensities for each topic using the articles publication date. Starting from a dataset D containing the news articles shared in Reddit, we extract words and phrases with the methodology described in [107], discarding terms with frequency below 10, to form a vocabulary V with around 60k terms.…”
Section: E Topic Modelingmentioning
confidence: 99%
“…Although there are many approaches to temporal and hierarchical topic modeling [104][105][106], we choose to apply NMF to the dataset, and then build time-varying intensities for each topic using the articles publication date. Starting from a dataset D containing the news articles shared in Reddit, we extract words and phrases with the methodology described in [107], discarding terms with frequency below 10, to form a vocabulary V with around 60k terms.…”
Section: E Topic Modelingmentioning
confidence: 99%
“…Nonnegative matrix factorization (NMF) [67] is the counterpart of LDA for matrix factorization. Although there are many approaches to temporal and hierarchical topic modeling [68][69][70], we chose to apply NMF to the data set and then build time-varying intensities for each topic using the publication dates of the articles. Starting from a data set D containing the news articles shared in Reddit, we extracted words and phrases with the methodology described in [71], discarding terms with frequencies >10, to form a vocabulary V with approximately 60,000 terms.…”
Section: Topic Modelingmentioning
confidence: 99%