2012
DOI: 10.1075/scl.49.03joh
|View full text |Cite
|
Sign up to set email alerts
|

OBT+stat

Abstract: The paper describes the improvement of the rule-based Constraint Grammar (CG) Oslo-Bergen Tagger (OBT) by the addition of a statistical module. It is in the nature of CG taggers to leave some words ambiguous between different readings, due to a lack of coverage by the linguistics-based rules. Such ambiguities are often a problem for applications that use the tagger, among them the Norwegian Newspaper Corpus. Our statistical module not only removes part of speech (PoS) and morphological ambiguities, but also di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…The other purpose of this annotation is to inform the other tool used to analyze the speeches so that it can be configured correctly. This tool, the Oslo-Bergen Tagger (OBT), annotates text with sentence and token boundaries, lemmas, parts of speech (PoS), and morphological features (Johannessen et al 2012).…”
Section: The Talk Of Norway Data Setmentioning
confidence: 99%
“…The other purpose of this annotation is to inform the other tool used to analyze the speeches so that it can be configured correctly. This tool, the Oslo-Bergen Tagger (OBT), annotates text with sentence and token boundaries, lemmas, parts of speech (PoS), and morphological features (Johannessen et al 2012).…”
Section: The Talk Of Norway Data Setmentioning
confidence: 99%
“…See https://www.hf.uio.no/ilos/english/services/knowledge-resources/omc/sub-corpora/.6 The English texts were tagged with the TreeTagger (www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/) and the Norwegian texts with the Oslo Bergen Tagger(Johannessen et al, 2012).…”
mentioning
confidence: 99%