2015
DOI: 10.2139/ssrn.2709072
|View full text |Cite
|
Sign up to set email alerts
|

Pre-Experiments on Annotation of Russian Coreference Corpus

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…Missing/redundant markable (about 20% and 28% respectively) is the case when an annotator missed one or several mentions, although the coreference cluster is there in both annotation versions. For these cases we examined types of NPs missed by one annotator in the annotation from scratch stage, having preserved the taxonomy as in (Toldova et al, 2015) in order to compare them. See Table 2 to check numbers.…”
Section: Disagreement Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Missing/redundant markable (about 20% and 28% respectively) is the case when an annotator missed one or several mentions, although the coreference cluster is there in both annotation versions. For these cases we examined types of NPs missed by one annotator in the annotation from scratch stage, having preserved the taxonomy as in (Toldova et al, 2015) in order to compare them. See Table 2 to check numbers.…”
Section: Disagreement Analysismentioning
confidence: 99%
“…Thus, most of the largest corpora with coreference annotation contain predominantly English texts; however, with the growing interest in natural language processing of Non-English languages, corpora in other languages are being developed more often. As for the Russian language, there now exist two such datasets, one of them being RuCor (Toldova et al, 2014;Toldova et al, 2015) and the other AnCor (Budnikov et al, 2019).…”
Section: Introductionmentioning
confidence: 99%
“…Our experiments were conducted on RuCoref, a corpus of Russian texts with coreference annotation 2 released during RU-EVAL evaluation forum ( [36]). This corpus consists of short texts in a variety of genres: news, scientific articles, blog posts and fiction.…”
Section: Datamentioning
confidence: 99%
“…Methods of anaphora resolution (see, e.g. [7]) could not help find different names for main characters.…”
Section: Introductionmentioning
confidence: 99%