2014
DOI: 10.1007/978-3-319-08434-3_16
|View full text |Cite
|
Sign up to set email alerts
|

POS Tagging and Its Applications for Mathematics

Abstract: Abstract. Content analysis of scientific publications is a nontrivial task, but a useful and important one for scientific information services. In the Gutenberg era it was a domain of human experts; in the digital age many machine-based methods, e.g., graph analysis tools and machine-learning techniques, have been developed for it. Natural Language Processing (NLP) is a powerful machinelearning approach to semiautomatic speech and language processing, which is also applicable to mathematics. The well establish… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(12 citation statements)
references
References 5 publications
(2 reference statements)
0
12
0
Order By: Relevance
“…Part-of-Speech Tagging (POS Tagging) assigns a tag to each word in a given text [15]. Although the POS Tagging task is mainly a tool for text processing, it can be adjusted to scientific documents with mathematical expressions [29,26]. Therefore, we tag math-related tokens of the text with math specific tags [29].…”
Section: Find Definiens Candidatesmentioning
confidence: 99%
See 1 more Smart Citation
“…Part-of-Speech Tagging (POS Tagging) assigns a tag to each word in a given text [15]. Although the POS Tagging task is mainly a tool for text processing, it can be adjusted to scientific documents with mathematical expressions [29,26]. Therefore, we tag math-related tokens of the text with math specific tags [29].…”
Section: Find Definiens Candidatesmentioning
confidence: 99%
“…Although the POS Tagging task is mainly a tool for text processing, it can be adjusted to scientific documents with mathematical expressions [29,26]. Therefore, we tag math-related tokens of the text with math specific tags [29]. If a math token is only one identifier, an identifier tag is assigned rather that a formula tag.…”
Section: Find Definiens Candidatesmentioning
confidence: 99%
“…In 2014, Schoeneberg et al discussed part-of-speech (POS) Tagging and its applications for mathematics [11]. Their goal was to adapt NLP methods to the special requirements for STEM document content analysis.…”
Section: B Mathematical (Stem) Document Classificationmentioning
confidence: 99%
“…The documents contain about 60 million mathematical formulae, including monomial expressions, e.g., x or t 2 . The disc size of the dataset is about 174 GB uncompressed and is intended to be used for Information Retrieval research tasks, such as Natural Language Processing, text analysis, and mathematical expression tree structure search 11 .…”
Section: A Ntcir Arxiv Datasetmentioning
confidence: 99%
“…Perhaps the area of mathematical software with the greatest potential for machine learning applications is Mathematical Knowledge Management (MKM) [12] since many of the tasks are similar to Natural Language Processing (NLP) where machine learning has seen extensive use. For example, [35] describes the automatic identification of a suitable top level from the Mathematics Subject Classification (MSC) system for thousands of articles using an SVM; while [29] describes how NLP techniques were adapted to build a part of speech tagger used for key phrase extraction in the database zbMATH.…”
Section: Mathematical Knowledge Managementmentioning
confidence: 99%