Interpretation of Natural Language Rules in Conversational Machine Reading

Saeidi, Marzieh; Bartolo, Max; Lewis, Patrick; Singh, Sameer; Rocktäschel, Tim; Sheldon, Mike; Bouchard, Guillaume; Riedel, Sebastian

doi:10.18653/v1/d18-1233

Cited by 121 publications

(182 citation statements)

References 26 publications

Supporting

Mentioning

180

Contrasting

Order By: Relevance

“…We report the results of our approach, the various baselines, as well as the previous state-of-the-art (SOTA) scores where applicable in Table 1 and 2 for SHARC and in Table 3 for DAILY DIALOG. On the SHARC dataset, we observe very poor BLEU-4 performance for the encoder-decoder Transformer (E&D), which is consistent with results from Saeidi et al (2018), who could not get a LSTM-based network to work without an additional classification head. Adding BERT (E&D+B) slightly improves performance.…”

Section: Resultssupporting

confidence: 84%

Attending to Future Tokens for Bidirectional Sequence Generation

Lawrence

Kotnis

Niepert

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Neural sequence generation is typically performed token-by-token and left-to-right. Whenever a token is generated only previously produced tokens are taken into consideration. In contrast, for problems such as sequence classification, bidirectional attention, which takes both past and future tokens into consideration, has been shown to perform much better. We propose to make the sequence generation process bidirectional by employing special placeholder tokens. Treated as a node in a fully connected graph, a placeholder token can take past and future tokens into consideration when generating the actual output token. We verify the effectiveness of our approach experimentally on two conversational tasks where the proposed bidirectional model outperforms competitive baselines by a large margin.

show abstract

Section: Resultssupporting

confidence: 84%

Attending to Future Tokens for Bidirectional Sequence Generation

Lawrence

Kotnis

Niepert

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

show abstract

“…The CSQA dataset [30] takes preliminary steps towards the sequential KG-QA paradigm, but it is extremely artificial: initial and follow-up questions are generated semi-automatically via templates, and sequential utterances are only simulated by stitching questions with shared entities or relations in a thread, without a logical flow. QBLink [9], CoQA [27], ans ShARC [29] are recent resources for sequential QA over text. The SQA resource [16], derived from WikiTableQuestions [25], is aimed at driving conversational QA over (relatively small) Web tables.…”

Section: The Convquestions Benchmark 41 Benchmark Creationmentioning

confidence: 99%

“…However, such table-cell search methods cannot scale to real-world, large-scale curated KGs. QBLink [9], CoQA [27], and ShARC [29] are recent benchmarks aimed at driving conversational QA over text, and the allied paradigm in text comprehension on interactive QA [18]. Hixon et al [13] try to learn concept knowledge graphs from conversational dialogues over science questions, but such KGs are fundamentally different from curated ones like Wikidata with millions of facts.…”

Section: Related Workmentioning

confidence: 99%

Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context Expansion

Christmann

Roy

Abujabal

et al. 2019

Proceedings of the 28th ACM International Conference on Information and Knowledge Management

118

View full text Add to dashboard Cite

Fact-centric information needs are rarely one-shot; users typically ask follow-up questions to explore a topic. In such a conversational setting, the user's inputs are often incomplete, with entities or predicates left out, and ungrammatical phrases. This poses a huge challenge to question answering (QA) systems that typically rely on cues in full-fledged interrogative sentences. As a solution, we develop Convex: an unsupervised method that can answer incomplete questions over a knowledge graph (KG) by maintaining conversation context using entities and predicates seen so far and automatically inferring missing or ambiguous pieces for follow-up questions. The core of our method is a graph exploration algorithm that judiciously expands a frontier to find candidate answers for the current question. To evaluate Convex, we release ConvQuestions, a crowdsourced benchmark with 11, 200 distinct conversations from five different domains. We show that Convex: (i) adds conversational support to any stand-alone QA system, and (ii) outperforms state-of-the-art baselines and question completion strategies.

show abstract

“…The most closely related datasets to ROPES are ShARC (Saeidi et al, 2018), Open-BookQA (Mihaylov et al, 2018), and QuaRel (Tafjord et al, 2019). ShARC shares the same goal of understanding causes and effects (in terms of specified rules), but frames it as a dialogue where the system has to also generate questions to gain complete information.…”

Section: Related Workmentioning

confidence: 99%

Reasoning Over Paragraph Effects in Situations

Lin¹,

Tafjord²,

Clark³

et al. 2019

Proceedings of the 2nd Workshop on Machine Reading for Question Answering

View full text Add to dashboard Cite

A key component of successfully reading a passage of text is the ability to apply knowledge gained from the passage to a new situation. In order to facilitate progress on this kind of reading, we present ROPES, a challenging benchmark for reading comprehension targeting Reasoning Over Paragraph Effects in Situations. We target expository language describing causes and effects (e.g., "animal pollinators increase efficiency of fertilization in flowers"), as they have clear implications for new situations. A system is presented a background passage containing at least one of these relations, a novel situation that uses this background, and questions that require reasoning about effects of the relationships in the background passage in the context of the situation. We collect background passages from science textbooks and Wikipedia that contain such phenomena, and ask crowd workers to author situations, questions, and answers, resulting in a 14,102 question dataset. We analyze the challenges of this task and evaluate the performance of state-of-the-art reading comprehension models. The best model performs only slightly better than randomly guessing an answer of the correct type, at 51.9% F1, well below the human performance of 89.0%.

show abstract

Interpretation of Natural Language Rules in Conversational Machine Reading

Cited by 121 publications

References 26 publications

Attending to Future Tokens for Bidirectional Sequence Generation

Attending to Future Tokens for Bidirectional Sequence Generation

Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context Expansion

Reasoning Over Paragraph Effects in Situations

Contact Info

Product

Resources

About