2020
DOI: 10.3233/sw-200373
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale semantic exploration of scientific literature using topic-based hashing algorithms

Abstract: Searching for similar documents and exploring major themes covered across groups of documents are common activities when browsing collections of scientific papers. This manual knowledge-intensive task can become less tedious and even lead to unexpected relevant findings if unsupervised algorithms are applied to help researchers. Most text mining algorithms represent documents in a common feature space that abstract them away from the specific sequence of words used in them. Probabilistic Topic Models reduce th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 37 publications
0
5
0
Order By: Relevance
“…Most of these problems are reflected in older kindergarten teachers and township kindergartens with relatively backward economic levels. Some kindergarten teachers lack the understanding of multimedia teaching and do not know the important role of multimedia teaching in early childhood teaching, which affects the application effect of multimedia teaching in preschool education [ 10 12 ]. Thematic search engine appears in such a background.…”
Section: Introductionmentioning
confidence: 99%
“…Most of these problems are reflected in older kindergarten teachers and township kindergartens with relatively backward economic levels. Some kindergarten teachers lack the understanding of multimedia teaching and do not know the important role of multimedia teaching in early childhood teaching, which affects the application effect of multimedia teaching in preschool education [ 10 12 ]. Thematic search engine appears in such a background.…”
Section: Introductionmentioning
confidence: 99%
“…The notion of topics is discarded and therefore the ability to make thematic explorations of documents. Recently, a hashing algorithm that groups similar documents and preserves the notion of topics has been proposed [2]. It defines a hierarchical set-type data where each level of the hierarchy indicates the importance of the topic according to its distribution.…”
Section: Related Workmentioning
confidence: 99%
“…Documents represented as a weighted mixture of latent topics (per-document topic distributions) are then annotated in these feature spaces with the relation between topics inside each hierarchy level. Regardless of their language, they are then described by cross-lingual concepts (based on WordNet-synset annotations) and hash codes are calculated to summarize their content [2]. The hash expression sets a 3-level hierarchy of cross-lingual concepts.…”
Section: Document Representationmentioning
confidence: 99%
See 1 more Smart Citation
“…After a careful reviewing process, two papers were accepted, which focus on discoverability and findability of scientific products. In [2] the authors ease exploration of scientific literature by grouping similar papers according to their topic; while in [3] the authors assist other researchers in describing and summarizing their scientific experiments based on publicly available metadata.…”
Section: Special Issue Overviewmentioning
confidence: 99%