2010
DOI: 10.1163/9789042027695
|View full text |Cite
|
Sign up to set email alerts
|

A resource-light approach to morpho-syntactic tagging

Abstract: Morpho-syntactic tagging is the process of assigning part of speech (POS), case, number, gender, and other morphological information to each word in a corpus. Morpho-syntactic tagging is an important step in natural language processing. Corpora that have been morphologically tagged are very useful both for linguistic research, e.g. finding instances or frequencies of particular constructions in large corpora, and for further computational processing, such as syntactic parsing, speech recognition, stemming, and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 47 publications
0
10
0
Order By: Relevance
“…The MULTEXT-East morphosyntactic specifications currently lack consistency between the languages (Przepiórkowski and Woliński 2003;Derzhanski and Kotsyba 2009;Feldman and Hana 2010), and a typology of the problems is summarised in Rosen (2010). Problematic cases are divided into those where different features in various languages are used to describe the same phenomenon, and, conversely, the same features are used to describe different phenomena.…”
Section: Discussionmentioning
confidence: 99%
“…The MULTEXT-East morphosyntactic specifications currently lack consistency between the languages (Przepiórkowski and Woliński 2003;Derzhanski and Kotsyba 2009;Feldman and Hana 2010), and a typology of the problems is summarised in Rosen (2010). Problematic cases are divided into those where different features in various languages are used to describe the same phenomenon, and, conversely, the same features are used to describe different phenomena.…”
Section: Discussionmentioning
confidence: 99%
“…Fine-grained POS (morpho-syntactic or morphological) tagging is the procedure of determining POS, tense, number, gender and other morphological information for every single word in a sentence (Feldman, 2006;Schmid and Laws, 2008). POS tagging is an essential language analysis task in almost all NLP systems, including information extraction, corpus annotation Science Publications JCS projects, word-sense disambiguation and etc.…”
Section: Introductionmentioning
confidence: 99%
“…At the same time, no manually compiled lexicon can cover an arbitrary text. Feldman and Hana (2010) document this on a statistic of Czech nouns obtained from the PDT corpus (Hajič et al. 2000).…”
Section: Introductionmentioning
confidence: 99%
“…2008; Oflazer et al. 2001), and manual encoding of basic linguistic facts (e.g., Cucerzan and Yarowsky 2002; Feldman and Hana 2010; Tepper and Xia 2010). Learning from a different language (e.g., Bosch et al.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation