Proceedings of the 19th International Conference on Computational Linguistics - 2002
DOI: 10.3115/1071884.1071909
|View full text |Cite
|
Sign up to set email alerts
|

The LinGO Redwoods treebank motivation and preliminary applications

Abstract: The LinGO Redwoods initiative is a seed activity in the design and development of a new type of treebank. While several medium-to large-scale treebanks exist for English (and for other major languages), pre-existing publicly available resources exhibit the following limitations: (i) annotation is mono-stratal, either encoding topological (phrase structure) or tectogrammatical (dependency) information, (ii) the depth of linguistic information recorded is comparatively shallow, (iii) the design and format of lin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
71
0
1

Year Published

2003
2003
2017
2017

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 59 publications
(72 citation statements)
references
References 14 publications
0
71
0
1
Order By: Relevance
“…We investigate these ideas via experiments in probabilistic parse selection from among a set of alternatives licensed by a hand-built grammar in the context of the newly developed Redwoods HPSG treebank [14]. HPSG (Head-driven Phrase Structure Grammar) is a modern constraint-based lexicalist (unification) grammar, described in [15].…”
Section: Methodsmentioning
confidence: 99%
“…We investigate these ideas via experiments in probabilistic parse selection from among a set of alternatives licensed by a hand-built grammar in the context of the newly developed Redwoods HPSG treebank [14]. HPSG (Head-driven Phrase Structure Grammar) is a modern constraint-based lexicalist (unification) grammar, described in [15].…”
Section: Methodsmentioning
confidence: 99%
“…B&L use head-driven generative parsing strategies from sentential parsing (e. g., Collins 2003) to build sdrss for the Verbmobil appointment scheduling and travel planning dialogs that make up a large part of the Redwoods Treebank (Oepen et al 2002). An example dialog is that given in Figure 5.…”
Section: Discourse Parsing For Dialogmentioning
confidence: 99%
“…3 The ERG supports both parsing and generation, via the semantic formalism of Minimal Recursion Semantics ("MRS": Copestake et al (2005)). To generate paraphrases with the ERG, we simply parse a given input, select the preferred parse using a pretrained parse selection model (Oepen et al, 2002), and exhaustively generate from the resultant MRS. We then use uniform random sampling to select from the generator outputs, which potentially numbers in the thousands of variants. To handle unknown words during parsing and generation, we use POS mapping and introduce a unique relation for each unknown word, which we use to substitute the unknown word back in to the generation output.…”
Section: Generating Text Noisementioning
confidence: 99%