2019
DOI: 10.1109/access.2018.2887314
|View full text |Cite
|
Sign up to set email alerts
|

Mining Dynamics of Research Topics Based on the Combined LDA and WordNet

Abstract: A large volume of research documents are available online for us to access and analysis. It is very important to detect and mine the dynamics of the research topics from these large corpora. In this paper, we propose an improved method by introducing WordNet to LDA. This approach is to find latent topics of large corpora, and then we propose many methods to analyze the dynamics of those topics. We apply the methodology to two large document collections: 1940 papers from NIPS 00-13 (1987-2000) and 2074 papers f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 22 publications
0
12
0
Order By: Relevance
“…This creates a communication barrier between business executives and modeling analysts [10]- [12]. In order to provide people of different roles with the information they need about the process [13], [14], the importance of maintaining text-based process descriptions alongside model-based ones has been recognized [15], [16].…”
Section: Process Models Have Beenmentioning
confidence: 99%
“…This creates a communication barrier between business executives and modeling analysts [10]- [12]. In order to provide people of different roles with the information they need about the process [13], [14], the importance of maintaining text-based process descriptions alongside model-based ones has been recognized [15], [16].…”
Section: Process Models Have Beenmentioning
confidence: 99%
“…In recent years, scholars in the field of microbiology and molecular biosciences have used the LDA model and its improved algorithm to identify scientific research topics and applied it to intelligence analysis based on text data (Qiu and Yu, 2018;Mehari et al, 2019). In addition to using a corpus (Li et al, 2019), topic modeling can also use additional information, such as time stamp and author network (Rosen-Zvi et al, 2010;Xu et al, 2020), as prior variables or observation variables. Other variants are by embedding new potential variables such as emotions, or by adding constraints applicable to a corpus from a specific scenario (Wang et al, 2018).…”
Section: Topic Modelingmentioning
confidence: 99%
“…A lot of papers work with the comparison between LSA and LDA but it is still not clear which is the winner [23] and it basically depends on cases and corpora. WordNet is applied to LDA to improve the performance when it allows us to differentiate word senses and inherit hypernym and hyponyms hierarchy information to broaden the co-occurrence of terms in documents [24,25]. Many recent methods based on word vectors (Word2Vec, GloVe) generally outperform traditional ones (LSA, PLSA) because they can capture the relatedness of words in topics beside word frequency distribution [11,26].…”
Section: Literature Reviewsmentioning
confidence: 99%