2020
DOI: 10.1186/s13326-020-00226-w
|View full text |Cite
|
Sign up to set email alerts
|

Methodologically grounded semantic analysis of large volume of chilean medical literature data applied to the analysis of medical research funding efficiency in Chile

Abstract: Background Medical knowledge is accumulated in scientific research papers along time. In order to exploit this knowledge by automated systems, there is a growing interest in developing text mining methodologies to extract, structure, and analyze in the shortest time possible the knowledge encoded in the large volume of medical literature. In this paper, we use the Latent Dirichlet Allocation approach to analyze the correlation between funding efforts and actually published research results in order to provide … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…In order to perform topic modeling on the translated Instagram posts, we used (LDA), the most widely used topic modeling algorithm ( Kapoor et al, 2018 ; Wolff et al, 2020 ). To estimate the optimal quantity of topics, we applied the ldatuning package ( Nikita & Chaney, 2016 ) which calculates four different metrics relative to the document-term matrix of the Instagram posts corpus in order to identify a preferable number of topics for the LDA model.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to perform topic modeling on the translated Instagram posts, we used (LDA), the most widely used topic modeling algorithm ( Kapoor et al, 2018 ; Wolff et al, 2020 ). To estimate the optimal quantity of topics, we applied the ldatuning package ( Nikita & Chaney, 2016 ) which calculates four different metrics relative to the document-term matrix of the Instagram posts corpus in order to identify a preferable number of topics for the LDA model.…”
Section: Methodsmentioning
confidence: 99%
“…In the two-dimensional space, the distance between the circles’ centers is a measure of the similarity of the topics: the most closely related topics have the shortest distance. This method allows topics to overlap each other in terms of content, rather than being separated into discrete groups, in a way that mirrors the typical use of natural language ( Wolff et al, 2020 ). This tool allows us to grasp the meaning of each topic, to determine the prevalence of each topic in the Instagram posts, and to infer the similarity link between each of the obtained topics.…”
Section: Methodsmentioning
confidence: 99%