2013
DOI: 10.1002/1944-2866.poi331
|View full text |Cite
|
Sign up to set email alerts
|

Mapping the public agenda with topic modeling: The case of the Russian livejournal

Abstract: This article describes agendas as “packages” of topics of varying salience, set by the Russian Internet users on Russia's leading blog platform LiveJournal. The research involved modeling LiveJournal's topic structure, viewed as an important component of what is termed here self‐generated public opinion. Topic modeling was performed automatically with the LDA algorithm, and complemented with hand labeling of topics. Data were collected by software created by the authors to generate a relational database storin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
47
0
3

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 62 publications
(52 citation statements)
references
References 37 publications
2
47
0
3
Order By: Relevance
“…The sample of popular bloggers has comprised all posts written by top 2000 bloggers, the threshold being taken from the previous research [8]. "Ordinary" bloggers were sampled randomly from the range 2,001-150,000 so that the total number of posts be equal to that of the top bloggers, which resulted in the sample of 20,000 "ordinary" bloggers.…”
Section: Data Sample and Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The sample of popular bloggers has comprised all posts written by top 2000 bloggers, the threshold being taken from the previous research [8]. "Ordinary" bloggers were sampled randomly from the range 2,001-150,000 so that the total number of posts be equal to that of the top bloggers, which resulted in the sample of 20,000 "ordinary" bloggers.…”
Section: Data Sample and Methodsmentioning
confidence: 99%
“…A Latent Dirichlet Allocation (LDA) algorithm with Gibbs sampling [3] was run over the joint collection of pre-processed posts yielding the probabilities of each topic in each text, as well as the probabilities of each unique word in each topic (topic-document matrix and word-topic matrix, respectively). The number of topics was set = 120 based on earlier experiments [8]. The probabilities of each topic in texts were then summed up separately for each group of bloggers ("top" vs "long tail"), which gave us the "weights" of each topic in the two groups.…”
Section: Data Sample and Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…В последнем случае используются дополнительные лингвистиче-ские инструменты, в том числе специальные словари. Особой задачей является те-матическое моделирование, которое нашло свое приложение в изучении повестки дня и онлайн-общественного мнения в блогосфере (Koltsova, Koltcov, 2013).…”
Section: критические исследования интернета и новых медиаunclassified
“…Additionally, the authors predict which users are likely to comment on which blogs. Finally, another recent example applies the same method to the analysis of issue salience in the Russian blogosphere (Kolstova and Koltcov, 2013). The authors use the method to identify a shift in topics during the political protests that took place during the parliamentary and presidential elections in late 2011 and early 2012.…”
Section: Unsupervised Learning Applicationsmentioning
confidence: 99%