2021
DOI: 10.3389/frma.2020.600382
|View full text |Cite
|
Sign up to set email alerts
|

Large Scale Subject Category Classification of Scholarly Papers With Deep Attentive Neural Networks

Abstract: Subject categories of scholarly papers generally refer to the knowledge domain(s) to which the papers belong, examples being computer science or physics. Subject category classification is a prerequisite for bibliometric studies, organizing scientific publications for domain knowledge extraction, and facilitating faceted searches for digital library search engines. Unfortunately, many academic papers do not have such information as part of their metadata. Most existing methods for solving this task focus on un… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
11
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(14 citation statements)
references
References 41 publications
1
11
0
2
Order By: Relevance
“…While some of the reported indicators, such as F-scores, are relatively low, we think it is instructive to compare our results to those of the recent studies by Kandimalla et al (2020) and Dunham et al (2020). While these authors report better accuracy, it should be highlighted that in this paper we specifically look at the applicability of supervised learning in the context of social sciences.…”
Section: Discussionsupporting
confidence: 54%
See 3 more Smart Citations
“…While some of the reported indicators, such as F-scores, are relatively low, we think it is instructive to compare our results to those of the recent studies by Kandimalla et al (2020) and Dunham et al (2020). While these authors report better accuracy, it should be highlighted that in this paper we specifically look at the applicability of supervised learning in the context of social sciences.…”
Section: Discussionsupporting
confidence: 54%
“…As Kandimalla and colleagues note, this is not an easy task given the large overlap in terminology and the proximity of the categories. Kandimalla et al (2020) have for that reason dropped or collapsed 120 out of 235 of the SC from their data set. In addition, they drop documents assigned to multiple disciplines.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…These methods usually cover one or more of three tasks: i) detection of research topics, ii) scientometric analyses, and iii) prediction of research trends. In the first task, the articles are classified according to a set of research topics, typically using probabilistic topics models or different types of classifiers [8,9,33,12,87]. In the second task, the topics are analysed according to different bibliometrics over time [32,29,22].…”
Section: Introductionmentioning
confidence: 99%