2012
DOI: 10.46430/phen0017
|View full text |Cite
|
Sign up to set email alerts
|

Getting Started with Topic Modeling and MALLET

Abstract: In this lesson you will first learn what topic modeling is and why you might want to employ it in your research. You will then learn how to install and work with the MALLET natural language processing toolkit to do so.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0
4

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 77 publications
(37 citation statements)
references
References 0 publications
0
33
0
4
Order By: Relevance
“…As the words in a tweet are known, topics, which are latent variables, can be estimated through Gibbs sampling 29. We use the Mallet implementation of the LDA algorithm, adjusting one parameter (alpha=5) to favour fewer topics per tweet 30. All other parameters were kept at their default.…”
Section: Methodsmentioning
confidence: 99%
“…As the words in a tweet are known, topics, which are latent variables, can be estimated through Gibbs sampling 29. We use the Mallet implementation of the LDA algorithm, adjusting one parameter (alpha=5) to favour fewer topics per tweet 30. All other parameters were kept at their default.…”
Section: Methodsmentioning
confidence: 99%
“…The number of topics retrieved for tweets about each drug was varied using an optimum topic number test as suggested by a previous method [ 59 ]. We applied the LDA topic model to the documents (tweets) with a randomly specified number of topics and observed the per-document topic distributions results.…”
Section: Methodsmentioning
confidence: 99%
“…The same word may appear in multiple topics, and in some cases the topics may be more about the genre or style of the discourse than actual content‐bearing words that might more usually be viewed as a topic. Underwood () and Murakami, Thompson, Hunston, and Vajn () describe the LDA method and provide example topic lists, and Graham, Weingart, and Milligan () present a tutorial on how to implement LDA in the MALLET software. Linguists are suspicious of LDA for at least three reasons.…”
Section: Assessment Against Core Principles In Computational Linguisticsmentioning
confidence: 99%