DOI: 10.1007/978-3-540-85110-3_44
|View full text |Cite
|
Sign up to set email alerts
|

Automated Classification and Categorization of Mathematical Knowledge

Abstract: Abstract.There is a common Mathematics Subject Classification (MSC) System used for categorizing mathematical papers and knowledge. We present results of machine learning of the MSC on full texts of papers in the mathematical digital libraries DML-CZ and NUMDAM. The F1-measure achieved on classification task of top-level MSC categories exceeds 89%. We describe and evaluate our methods for measuring the similarity of papers in the digital library based on paper full texts.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
40
0
2

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 45 publications
(59 citation statements)
references
References 19 publications
1
40
0
2
Order By: Relevance
“…We have developed an MSC classifier (guessed MSC) that is able to assign the top-level MSC for the retro-digitized articles. Our results convincingly demonstrate the feasibility of a machine learning approach to the classification of mathematical papers [25].…”
Section: Mathematical Document Classification and Categorizationsupporting
confidence: 66%
See 2 more Smart Citations
“…We have developed an MSC classifier (guessed MSC) that is able to assign the top-level MSC for the retro-digitized articles. Our results convincingly demonstrate the feasibility of a machine learning approach to the classification of mathematical papers [25].…”
Section: Mathematical Document Classification and Categorizationsupporting
confidence: 66%
“…Terminology evolved and understanding certain expressions requires reading and understanding the whole paper in the context of its time. Help is at hand in the guise of MSC codes suggested by an automated classifier trained via machine learning techniques from a database of articles already classified [39,25].…”
Section: Handling Metadatamentioning
confidence: 99%
See 1 more Smart Citation
“…Perhaps the area of mathematical software with the greatest potential for machine learning applications is Mathematical Knowledge Management (MKM) [12] since many of the tasks are similar to Natural Language Processing (NLP) where machine learning has seen extensive use. For example, [35] describes the automatic identification of a suitable top level from the Mathematics Subject Classification (MSC) system for thousands of articles using an SVM; while [29] describes how NLP techniques were adapted to build a part of speech tagger used for key phrase extraction in the database zbMATH.…”
Section: Mathematical Knowledge Managementmentioning
confidence: 99%
“…Examples of such services include the Citation Service, responsible for citation resolving and indexing or Similarity Service [26,27], which would be able to return similar objects based on a predefined metrics and criteria. Similarly, additional extension services are hoped to be developed in the future by third parties or by the involved partners.…”
Section: Extensionsmentioning
confidence: 99%