Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020
DOI: 10.18653/v1/2020.emnlp-main.655
|View full text |Cite
|
Sign up to set email alerts
|

Information Seeking in the Spirit of Learning: A Dataset for Conversational Curiosity

Abstract: Open-ended human learning and informationseeking are increasingly mediated by digital assistants. However, such systems often ignore the user's pre-existing knowledge. Assuming a correlation between engagement and user responses such as "liking" messages or asking followup questions, we design a Wizard-of-Oz dialog task that tests the hypothesis that engagement increases when users are presented with facts related to what they know. Through crowd-sourcing of this experiment, we collect and release 14K dialogs … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 41 publications
0
4
0
Order By: Relevance
“…For both training and pretraining we use four different datasets: Curiosity (Rodriguez et al 2020) is a collection of conversations on a specific topic with separate knowledge (memory) from which they are based. Dialogue-based REAding comprehension exaMination (DREAM) (Sun et al 2019) is a dataset that is aimed at text comprehension, we adopt the text as our memory and the question as input to the system.…”
Section: Datasetsmentioning
confidence: 99%
“…For both training and pretraining we use four different datasets: Curiosity (Rodriguez et al 2020) is a collection of conversations on a specific topic with separate knowledge (memory) from which they are based. Dialogue-based REAding comprehension exaMination (DREAM) (Sun et al 2019) is a dataset that is aimed at text comprehension, we adopt the text as our memory and the question as input to the system.…”
Section: Datasetsmentioning
confidence: 99%
“…Knowledge-grounded social chat systems (Ghazvininejad et al, 2018;Dinan et al, 2019;Zhou et al, 2018;Moghe et al, 2018) incorporate external knowledge with the purpose of making the conversations more engaging and informative. Rodriguez et al (2020) collect a dataset for training a conversational agent to select knowledge to present based on the user's background, with the aim to maintain the user's interest in the conversation.…”
Section: Related Workmentioning
confidence: 99%
“…The datasets to train models in CIS are designed to facilitate a mixedinitiative dialogue, where the agent can also ask clarifying or follow-up questions to gather information concerning the IS query (Zhang and Bansal 2019). Alongside, datasets like ShARC help CIS models to improve response generation (Saeidi et al 2018). However, in this study, we restrict ourselves to solving an open sub-problem in CIS that generates ISQs.…”
Section: Related Workmentioning
confidence: 99%
“…Both QAD and QAMR consist of D only IS queries. Facebook Curiosity (FBC) (Rodriguez et al 2020) is another dataset that challenges ISEEQ to have both semantic relations and logical coherence. This is because queries are described in the form of Tp & Asp.…”
Section: Datasetsmentioning
confidence: 99%