2021
DOI: 10.1109/tlt.2021.3064798
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing Large Collections of Open-Ended Feedback From MOOC Learners Using LDA Topic Modeling and Qualitative Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 39 publications
(14 citation statements)
references
References 26 publications
1
13
0
Order By: Relevance
“…The Overall Topical Patterns Repeated modeling with topic numbers ranging from 3 to 26 on the corpus suggests that the optimized topic numbers in the US and Chinese reports are both 20, with the "Cv" coherence scores being 0.54 and 0.51, respectively. According to findings in some previous research (e.g., Nanda et al, 2021;Röder et al, 2015;Syed & Spruit, 2017), a "Cv" coherence score equals 0.50 or above is more likely to efficiently reflect the topical structure of the corpus under investigation. Based on their attached keywords, each topic is given a label that could properly summarize its topical theme.…”
Section: Resultsmentioning
confidence: 87%
“…The Overall Topical Patterns Repeated modeling with topic numbers ranging from 3 to 26 on the corpus suggests that the optimized topic numbers in the US and Chinese reports are both 20, with the "Cv" coherence scores being 0.54 and 0.51, respectively. According to findings in some previous research (e.g., Nanda et al, 2021;Röder et al, 2015;Syed & Spruit, 2017), a "Cv" coherence score equals 0.50 or above is more likely to efficiently reflect the topical structure of the corpus under investigation. Based on their attached keywords, each topic is given a label that could properly summarize its topical theme.…”
Section: Resultsmentioning
confidence: 87%
“…LDA is an unsupervised machine learning technique that utilizes a probabilistic graph model 23 . It determines and extracts words based on the contextual probability of words within documents and possesses certain empirical characteristics.…”
Section: Methodsmentioning
confidence: 99%
“…To provide answers to RQ2 which addresses education blockchain infrastructure, technologies and methods, we conducted a content analysis on all the 468 records. First, we excluded 5 entries with missing abstracts and performed topic modelling with the abstract data using LDA as used in some previous studies [40], [41]. To obtain the optimal number of topics to classify the abstract data, we calculated the topic coherence score [42] for various models while changing the number of topics: starting at 2 topics to 50 topics in steps of 6.…”
Section: Content Analysis For Case Studymentioning
confidence: 99%