2016
DOI: 10.1007/978-3-319-46547-0_33
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Classification of Springer Nature Proceedings with Smart Topic Miner

Abstract: Abstract. The process of classifying scholarly outputs is crucial to ensure timely access to knowledge. However, this process is typically carried out manually by expert editors, leading to high costs and slow throughput. In this paper we present Smart Topic Miner (STM), a novel solution which uses semantic web technologies to classify scholarly publications on the basis of a very large automatically generated ontology of research areas. STM was developed to support the Springer Nature Computer Science editori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
65
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
2

Relationship

6
1

Authors

Journals

citations
Cited by 29 publications
(65 citation statements)
references
References 17 publications
0
65
0
Order By: Relevance
“…As a reference topic ontology, we used the Computer Science Ontology (CSO), created to represent topics in the Rexplore system [3], which is currently being trialled by Springer Nature to classify proceedings in the field of Computer Science [17], such as the well-known LNCS series. CSO was created by applying the Klink-2 algorithm [18] to the 16 million publications of our Scopus-derived dataset [3].…”
Section: Input Knowledge Basesmentioning
confidence: 99%
See 1 more Smart Citation
“…As a reference topic ontology, we used the Computer Science Ontology (CSO), created to represent topics in the Rexplore system [3], which is currently being trialled by Springer Nature to classify proceedings in the field of Computer Science [17], such as the well-known LNCS series. CSO was created by applying the Klink-2 algorithm [18] to the 16 million publications of our Scopus-derived dataset [3].…”
Section: Input Knowledge Basesmentioning
confidence: 99%
“…However, as extensively discussed in previous works [3,17,18], these solutions ignore the rich network of semantic relationships between research topics and are often unable to distinguish research areas from other terms that may be used to annotate publications. Therefore, we exploit the topic ontology by associating to each paper i) all the concepts in CSO whose label is found either in the title, the abstract or the keyword set, as well as ii) all skos:broaderGeneric and iii) all relatedEquivalent areas of the initial set of topics extracted from the scholarly dataset.…”
Section: Generation Of Technology-topic Matricesmentioning
confidence: 99%
“…In scenarios where it already exists a taxonomy of research areas [21], it is also possible to use entity linking techniques [7] for mapping documents to related concepts. For example, the Smart Topic Miner [22], an application used by Springer Nature for annotating proceedings books, maps keywords extracted from papers to the automatically generated Klink-2 Computer Science Ontology [20] with the aim of selecting a comprehensive set of structured keywords. The approaches for topic evolution can be distinguished in discriminative and generative [13].…”
Section: Related Workmentioning
confidence: 99%
“…This is particularly challenging in the field of Computer Science, where new areas evolve constantly and taxonomies tend to become obsolete very quickly [22]. In the context of the collaboration between The Open University and Springer Nature [2,13], we focused on the issue of supporting the evolution of the Computer Science portion of PMC, concentrating in particular on some branches that had become obsolete.…”
Section: Motivating Scenario: Evolving Springer Nature Market Codesmentioning
confidence: 99%
“…Indeed, ontologies have proved to be very useful in the context of a variety of tasks [1], including the integration of data from different sources, domain reasoning, classification [2], generation of recommendations [3], cluster analysis [4], community detection [5], sentiment analysis, forecasting [6], and others. Naturally, ontologies need to be regularly maintained and need to evolve according to changes in the domain or new requirements from users or applications [7].…”
Section: Introductionmentioning
confidence: 99%