Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi) 2014
DOI: 10.3115/v1/w14-1104
|View full text |Cite
|
Sign up to set email alerts
|

Domain Adaptation with Active Learning for Coreference Resolution

Abstract: In the literature, most prior work on coreference resolution centered on the newswire domain. Although a coreference resolution system trained on the newswire domain performs well on newswire texts, there is a huge performance drop when it is applied to the biomedical domain. In this paper, we present an approach integrating domain adaptation with active learning to adapt coreference resolution from the newswire domain to the biomedical domain. We explore the effect of domain adaptation, active learning, and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…Domain adaptation in coreference resolution has been discussed often, both in the context of multiple text types within standard reference corpora (e.g. conversation, newswire and Web subcorpora in datasets such as the ACE corpus, see Yang et al 2012) or novel domains that are not included in most reference corpora, such as Biomedical NLP (Apostolova et al 2012, Zhao & Ng 2014. Such studies suggest a genre or text type effect for coreference; sentence type effects, by contrast, have not yet been studied.…”
Section: Methodsmentioning
confidence: 99%
“…Domain adaptation in coreference resolution has been discussed often, both in the context of multiple text types within standard reference corpora (e.g. conversation, newswire and Web subcorpora in datasets such as the ACE corpus, see Yang et al 2012) or novel domains that are not included in most reference corpora, such as Biomedical NLP (Apostolova et al 2012, Zhao & Ng 2014. Such studies suggest a genre or text type effect for coreference; sentence type effects, by contrast, have not yet been studied.…”
Section: Methodsmentioning
confidence: 99%
“…Yang et al (2012) adapts a model trained on the MUC-6 and ACE 2005 datasets to the biomedical domain using an active learning approach, applying data augmentation and pruning techniques. Zhao and Ng (2014) propose a feature-based active learning method to learn cross-domain knowledge. Unlike these works, we take advantage of the modern expressive power of the SpanBERT representation.…”
Section: Metricmentioning
confidence: 99%
“…Meanwhile, models for coreference resolution have improved due to neural architectures with millions of parameters and the emergence of pretrained encoders. However, model generalization across domains has always been a challenge (Yang et al, 2012;Zhao and Ng, 2014;Poot and van Cranenburgh, 2020;Aktaş et al, 2020). Since these models are usually engineered for a single dataset, they capture idiosyncrasies inherent in that dataset.…”
Section: Introductionmentioning
confidence: 99%