Investigating Query Expansion and Coreference Resolution in Question Answering on BERT

Bhattacharjee, S.; Haque, Rejwanul; Wenniger, Gideon Maillette de Buy; Way, Andy

doi:10.1007/978-3-030-51310-8_5

Cited by 18 publications

(13 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Same-language MT has been successfully used in many NLP applications, e.g. text-to-speech synthesis for creating alternative target sequences (Cahill et al, 2009), translation between varieties of the same language (Brazilian Portuguese to European Portuguese) (Fancellu et al, 2014), paraphrase generation (Plachouras et al, 2018), and producing many alternative sequences of a given input question in question answering (Bhattacharjee et al, 2020). In our case, we developed Portuguese-to-Portuguese MT systems that were able to generate n-best (same-language) alternative sentences of an input Portuguese sentence.…”

Section: Same-language Mtmentioning

confidence: 99%

“…dcu (Haque et al, 2020) compared both phrasebased and neural models by extending the STAPLE data with additional corpora (selected for similarity to the task data under a language model), with the neural model performing better. They generate sets of high-scoring predictions according to beam searches, majority voting, and other techniques, and also run these initial translations through an additional paraphrasing model, placing third in the Portuguese track.…”

Section: Baselinesmentioning

confidence: 99%

See 1 more Smart Citation

Proceedings of the Fourth Workshop on Neural Generation and Translation

2020

View full text Add to dashboard Cite

We describe the finding of the Fourth Workshop on Neural Generation and Translation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2020). First, we summarize the research trends of papers presented in the proceedings. Second, we describe the results of the three shared tasks 1) efficient neural machine translation (NMT) where participants were tasked with creating NMT systems that are both accurate and efficient, and 2) document-level generation and translation (DGT) where participants were tasked with developing systems that generate summaries from structured data, potentially with assistance from text in another language and 3) STAPLE task: creation of as many possible translations of a given input text. This last shared task was organised by Duolingo.1 BLEU+case.mixed+lang.en-de+numrefs.1+s mooth.exp+test.wmt * +tok.13a+version.1.4.8 for various WMT test sets 2 Participants are likely to have used these test sets in development. The WMT 2020 test set was not yet available and others were out of the domain the systems were trained for.

show abstract

Section: Same-language Mtmentioning

confidence: 99%

Section: Baselinesmentioning

confidence: 99%

Proceedings of the Fourth Workshop on Neural Generation and Translation

2020

View full text Add to dashboard Cite

show abstract

“…Information Extraction (IE) plays a fundamental role as a backbone component in many downstream applications. For example, an application such as question answering may be improved by relying on relation extraction (RE) (Hu et al, 2019;Yu et al, 2017), coreference resolution (Bhattacharjee et al, 2020;Gao et al, 2019), named entity recognition (NER) (Molla et al, 2006;Singh et al, 2018), and entity linking (EL) (Broscheit, 2019;Chen et al, 2017) components. This also holds for other applications such as personalized news recommendation (Karimi et al, 2018;Wang et al, 2018Wang et al, , 2019, fact checking (Thorne & Vlachos, 2018;Zhang & Ghorbani, 2020), opinion mining (Sun et al, 2017), semantic search (Cifariello et al, 2019), and conversational agents (Roller et al, 2020).…”

Section: Introductionmentioning

confidence: 99%

DWIE: An entity-centric dataset for multi-task document-level information extraction

Zaporojets

Deleu

Develder

et al. 2021

Information Processing & Management

View full text Add to dashboard Cite

This paper presents DWIE, the 'Deutsche Welle corpus for Information Extraction', a newly created multitask dataset that combines four main Information Extraction (IE) annotation subtasks: (i) Named Entity Recognition (NER), (ii) Coreference Resolution, (iii) Relation Extraction (RE), and (iv) Entity Linking. DWIE is conceived as an entity-centric dataset that describes interactions and properties of conceptual entities on the level of the complete document. This contrasts with currently dominant mention-driven approaches that start from the detection and classification of named entity mentions in individual sentences. Further, DWIE presented two main challenges when building and evaluating IE models for it. First, the use of traditional mention-level evaluation metrics for NER and RE tasks on entity-centric DWIE dataset can result in measurements dominated by predictions on more frequently mentioned entities. We tackle this issue by proposing a new entity-driven metric that takes into account the number of mentions that compose each of the predicted and ground truth entities. Second, the document-level multi-task annotations require the models to transfer information between entity mentions located in different parts of the document, as well as between different tasks, in a joint learning setting. To realize this, we propose to use graph-based neural message passing techniques between document-level mention spans. Our experiments show an improvement of up to 5.5 F 1 percentage points when incorporating neural graph propagation into our joint model. This demonstrates DWIE's potential to stimulate further research in graph neural networks for representation learning in multi-task IE. We make DWIE publicly available at https://github.com/klimzaporojets/DWIE.

show abstract

“…Coreference resolution refers to the task of detecting mentions of various entities and events and identifying groups of mentions referring to the same real-world entity or event. It is a fundamental NLP task that has several downstream applications such as question answering Bhattacharjee et al, 2020), textual entailment (Mitkov et al, 2012), building and maintaining KBs (Angeli et al, 2015;Angell et al, 2021), and multi-document summarization (Falke et al, 2017;Huang and Kurohashi, 2021). Often these downstream applications consume a set of documents, and thus require detection of coreference relations between event and entity mentions spread across documents such as multiple news articles.…”

Section: Introductionmentioning

confidence: 99%

Event and Entity Coreference using Trees to Encode Uncertainty in Joint Decisions

Yadav¹,

Monath²,

Angell³

et al. 2021

Proceedings of the Fourth Workshop on Computational Models of Reference, Anaphora and Coreference

View full text Add to dashboard Cite

Coreference decisions among event mentions and among co-occurring entity mentions are highly interdependent, thus motivating joint inference. Capturing the uncertainty over each variable can be crucial for inference among multiple dependent variables. Previous work on joint coreference employs heuristic approaches, lacking well-defined objectives, and lacking modeling of uncertainty on each side of the joint problem. We present a new approach of joint coreference, including (1) a formal cost function inspired by Dasgupta's cost for hierarchical clustering, and (2) a representation for uncertainty of clustering of event and entity mentions, again based on a hierarchical structure. We describe an alternating optimization method for inference that when clustering event mentions, considers the uncertainty of the clustering of entity mentions and viceversa. We show that our proposed joint model provides empirical advantages over state-ofthe-art independent and joint models.

show abstract

Investigating Query Expansion and Coreference Resolution in Question Answering on BERT

Cited by 18 publications

References 20 publications

Proceedings of the Fourth Workshop on Neural Generation and Translation

Proceedings of the Fourth Workshop on Neural Generation and Translation

DWIE: An entity-centric dataset for multi-task document-level information extraction

Event and Entity Coreference using Trees to Encode Uncertainty in Joint Decisions

Contact Info

Product

Resources

About