Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021
DOI: 10.18653/v1/2021.findings-acl.54
|View full text |Cite
|
Sign up to set email alerts
|

Dialogue in the Wild: Learning from a Deployed Role-Playing Game with Humans and Bots

Abstract: Much of NLP research has focused on crowdsourced static datasets and the supervised learning paradigm of training once and then evaluating test performance. As argued in de Vries et al. (2020), crowdsourced data has the issues of lack of naturalness and relevance to real-world use cases, while the static dataset paradigm does not allow for a model to learn from its experiences of using language (Silver et al., 2013). In contrast, one might hope for machine learning systems that become more useful as they inter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(10 citation statements)
references
References 17 publications
0
10
0
Order By: Relevance
“…In the following experiments, we focus on the textbased open-world adventure game dialogue setting of LIGHT (Urbanek et al, 2019). More specifically, we consider LightWild (Shuster et al, 2021b), a dataset of more than 40k episodes which are not specifically knowledge grounded, but require commonsense reasoning and attention to detail of the context instead. Hence, we do not consider retrieval-augmented models for this task.…”
Section: Lightmentioning
confidence: 99%
“…In the following experiments, we focus on the textbased open-world adventure game dialogue setting of LIGHT (Urbanek et al, 2019). More specifically, we consider LightWild (Shuster et al, 2021b), a dataset of more than 40k episodes which are not specifically knowledge grounded, but require commonsense reasoning and attention to detail of the context instead. Hence, we do not consider retrieval-augmented models for this task.…”
Section: Lightmentioning
confidence: 99%
“…An alternative approach is to deploy a system publicly, and collect interaction data and feedback from organic users directly. The promise of such an approach is that the distribution of data will more closely match those organic users' desires, rather than decided by the researchers themselves when creating datasets (Gabriel et al, 2020;Shuster et al, 2021b;Ouyang et al, 2022). Further, continued deployment of such a system, with appropriate learning systems, could then potentially keep improving over time (Carlson et al, 2010;Agichtein et al, 2006;Madotto et al, 2021;Shuster et al, 2021b), where (Hancock et al, 2019) refer to this approach as a self-feeding chatbot.…”
Section: Related Workmentioning
confidence: 99%
“…The promise of such an approach is that the distribution of data will more closely match those organic users' desires, rather than decided by the researchers themselves when creating datasets (Gabriel et al, 2020;Shuster et al, 2021b;Ouyang et al, 2022). Further, continued deployment of such a system, with appropriate learning systems, could then potentially keep improving over time (Carlson et al, 2010;Agichtein et al, 2006;Madotto et al, 2021;Shuster et al, 2021b), where (Hancock et al, 2019) refer to this approach as a self-feeding chatbot. The challenge, however, is that organic users may not be invested enough to want to provide adequate feedback, and some may be adversarial (Park et al, 2021) as in the case of Microsoft's Tay (Davis, 2016).…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations