Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2020
DOI: 10.18653/v1/2020.acl-main.427
|View full text |Cite
|
Sign up to set email alerts
|

CraftAssist Instruction Parsing: Semantic Parsing for a Voxel-World Assistant

Abstract: We propose a semantic parsing dataset focused on instruction-driven communication with an agent in the game Minecraft 1 . The dataset consists of 7K human utterances and their corresponding parses. Given proper world state, the parses can be interpreted and executed in game. We report the performance of baseline models, and analyze their successes and failures. * Equal contribution † Work done while at Facebook AI Research 1 Minecraft features: c Mojang Synergies AB included courtesy of Mojang AB

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 29 publications
0
13
0
Order By: Relevance
“…Minecraft as an Environment for Grounded Language Understanding substantiated the advantages of building an open interactive assistant in the sandbox construction game of Minecraft instead of a "real world" assistant, which is inherently complex and inherently costly to develop and maintain. The Minecraft world's constraints (e.g., coarse 3-d voxel grid and simple physics) and the regularities in the head of the distribution of in-game tasks allow numerous scenarios for grounded NLU research [Yao et al, 2020, Srinet et al, 2020. Minecraft is an appealing competition domain due to its popularity as a video game, of all games ever released, it has the second-most total copies sold.…”
Section: Competition Typementioning
confidence: 99%
“…Minecraft as an Environment for Grounded Language Understanding substantiated the advantages of building an open interactive assistant in the sandbox construction game of Minecraft instead of a "real world" assistant, which is inherently complex and inherently costly to develop and maintain. The Minecraft world's constraints (e.g., coarse 3-d voxel grid and simple physics) and the regularities in the head of the distribution of in-game tasks allow numerous scenarios for grounded NLU research [Yao et al, 2020, Srinet et al, 2020. Minecraft is an appealing competition domain due to its popularity as a video game, of all games ever released, it has the second-most total copies sold.…”
Section: Competition Typementioning
confidence: 99%
“…The goal of our work is to learn a mapping from natural language instructions to actions. Some general frameworks for mapping instructions to actions include language-conditioned reinforcement learning [11,8,10], semantic parsers learned from supervision [23,55], and a supervised mapping from instruction to action [42]. Tasks that focus on this problem include instruction guided navigation [4,37,42] and cooperative localization [24].…”
Section: Related Workmentioning
confidence: 99%
“…In the general case, we assume that we have access to a parser over a set of core instructions as well as a semantic segmentation and a sequential generation model. We build our virtual agent on top of the CraftAssist framework [22,55]. This software provides the tooling for creating Minecraft sessions and the virtual agent, including the semantic parsing system and semantic segmentation module that make up the core parsing framework.…”
Section: Core Semantic-parsing Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…The droidlet agent described above should be considered an example of how to build a system using the components; but we do not 1 The agent design happily extends to adding audio perception or audio sensory hardware. 2 After handling text spans; see [40] 3 If there are more than one, the agent defaults to the one nearest the human's point/looking-at location (if that is in memory), and otherwise the one nearest the agent Fig. 3.…”
Section: Platformmentioning
confidence: 99%