1998
DOI: 10.1007/bfb0026668
|View full text |Cite
|
Sign up to set email alerts
|

Part-of-speech tagging using decision trees

Abstract: { Imarquez, horacio}@ Is i. upc. esA b s t r a c t . We have appfied inductive learning of statistical decision trees to the Natural Language Processing (NLP) task of morphosyntactic disambiguation (Part Of Speech Tagging). Previous work showed that the acquired language models are independent enough to be easily incorporated, as a statistical core of rules, in any flexible tagger. They are also complete enough to be directly used as sets of POS disambiguation rules. We have implemented a quite simple and fast… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0
1

Year Published

1999
1999
2020
2020

Publication Types

Select...
6
2
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 55 publications
(22 citation statements)
references
References 13 publications
0
21
0
1
Order By: Relevance
“…Book and audio book consist of a total number of 40460 tokens (number of words) and 6117 types (number of unique words). The distribution of single word classes and bi-gram word class combinations occurring in the (audio) book were analysed and compared to a number of German reference corpora [59], and in addition, other German novels, by applying part-of-speech (POS) tagging [60][61][62] as implemented in the python library spaCy [63]. The similarities, or dissimilarities respectively, of all distributions are visualized using multi-dimensional scaling (MDS) [64][65][66][67].…”
Section: Speech Stimuli and Natural Language Text Datamentioning
confidence: 99%
“…Book and audio book consist of a total number of 40460 tokens (number of words) and 6117 types (number of unique words). The distribution of single word classes and bi-gram word class combinations occurring in the (audio) book were analysed and compared to a number of German reference corpora [59], and in addition, other German novels, by applying part-of-speech (POS) tagging [60][61][62] as implemented in the python library spaCy [63]. The similarities, or dissimilarities respectively, of all distributions are visualized using multi-dimensional scaling (MDS) [64][65][66][67].…”
Section: Speech Stimuli and Natural Language Text Datamentioning
confidence: 99%
“…In principle, however, it would be possible to produce a complete tagger on the basis of a learned statistical decision tree. Recently, this approach has indeed been explored (Marquez & Rodriguez, 1998). (Rumelhart et al 1986) are the most popular neural network architecture.…”
Section: Discussionmentioning
confidence: 99%
“…Márquetz et al [38] develop decision tree tagger for English POS tagging. They use non-incremental supervised learning from examples of TDIDT (Top Down Induction of Decision Tree) to construct the decision tree.…”
Section: Decision Tree (Dt) Modelmentioning
confidence: 99%