2021
DOI: 10.48550/arxiv.2108.07499
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Annotation Guidelines for the Turku Paraphrase Corpus

Abstract: This document describes the annotation guidelines used to construct the Turku Paraphrase Corpus. These guidelines were developed together with the corpus annotation, revising and extending the guidelines regularly during the annotation work. Our paraphrase annotation scheme uses the base scale 1-4, where labels 1 and 2 are used for negative candidates (not paraphrases), while labels 3 and 4 are paraphrases at least in the given context if not everywhere. In addition to base labeling, the scheme is enriched wit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…These flags are independent of each other and thus one label 4 paraphrase pair can have multiple flags, disregarding the directional subsumption flags. More detailed description of the labels together with example annotations is given in the annotation guidelines (Kanerva et al 2021a).…”
Section: Methodsmentioning
confidence: 99%
“…These flags are independent of each other and thus one label 4 paraphrase pair can have multiple flags, disregarding the directional subsumption flags. More detailed description of the labels together with example annotations is given in the annotation guidelines (Kanerva et al 2021a).…”
Section: Methodsmentioning
confidence: 99%
“…A total of 17page annotation manual was produced in collaboration among the annotators, and the guidelines were revised and extended regularly to account for new problematic cases. The full manual is published as a technical report (Kanerva et al 2021a), and some of the most interesting/relevant policies are discussed below.…”
Section: Annotation Guidelinesmentioning
confidence: 99%