Proceedings of the 28th International Conference on Computational Linguistics 2020
DOI: 10.18653/v1/2020.coling-main.508
|View full text |Cite
|
Sign up to set email alerts
|

Variation in Coreference Strategies across Genres and Production Media

Abstract: In response to (i) inconclusive results in the literature as to the properties of coreference chains in written versus spoken language, and (ii) a general lack of work on automatic coreference resolution on both spoken language and social media, we undertake a corpus study involving the various genre sections of Ontonotes, the Switchboard corpus, and a corpus of Twitter conversations. Using a set of measures that previously have been applied individually to different data sets, we find fairly clear patterns of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Both s m and s a are computed by the FeedForward Neural Network (FFNN), and φ(x i , y) represents additional meta features. Unlike previous work, we do not include the specific genre as a feature; instead, we simply use a binary feature on whether the document is dialogue-based or article-based, since dialogues can exhibit quite different traits from written articles (Aktaş and Stede, 2020). We also adopt a speaker feature that indicates whether two candidates are from the same speaker, or whether the speaker information is not available, which is important for written articles or two-party dialogues.…”
Section: Mention-ranking (Mr)mentioning
confidence: 99%
See 1 more Smart Citation
“…Both s m and s a are computed by the FeedForward Neural Network (FFNN), and φ(x i , y) represents additional meta features. Unlike previous work, we do not include the specific genre as a feature; instead, we simply use a binary feature on whether the document is dialogue-based or article-based, since dialogues can exhibit quite different traits from written articles (Aktaş and Stede, 2020). We also adopt a speaker feature that indicates whether two candidates are from the same speaker, or whether the speaker information is not available, which is important for written articles or two-party dialogues.…”
Section: Mention-ranking (Mr)mentioning
confidence: 99%
“…Our SE model is further adapted upon SR model and aims to strengthen the speaker encoding for each candidate representation. As we are targeting on the coreference resolution in dialogues, encoding speaker interactions becomes more critical, especially for the correct understanding of the speaker-grounded personal pronouns that are more frequent in dialogues than other non-dialogue genres (Aktaş and Stede, 2020).…”
Section: Speaker Encoding (Se)mentioning
confidence: 99%