2021
DOI: 10.1609/aaai.v35i6.16627
|View full text |Cite
|
Sign up to set email alerts
|

Planning from Pixels in Atari with Learned Symbolic Representations

Abstract: Width-based planning methods have been shown to yield state-of-the-art performance in the Atari 2600 domain using pixel input. One successful approach, RolloutIW, represents states with the B-PROST boolean feature set. An augmented version of RolloutIW, pi-IW, shows that learned features can be competitive with handcrafted ones for width-based search. In this paper, we leverage variational autoencoders (VAEs) to learn features directly from pixels in a principled manner, and without supervision. The inference … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(11 citation statements)
references
References 28 publications
0
11
0
Order By: Relevance
“…VAE-IW (Dittadi, Drachmann, and Bolander 2021) extends RIW to learn encodings from screen images. In a training stage, the game is run using RIW until a fixed number of screens are encountered and saved.…”
Section: Vae-iwmentioning
confidence: 99%
See 4 more Smart Citations
“…VAE-IW (Dittadi, Drachmann, and Bolander 2021) extends RIW to learn encodings from screen images. In a training stage, the game is run using RIW until a fixed number of screens are encountered and saved.…”
Section: Vae-iwmentioning
confidence: 99%
“…Learning the features from data permits the encoding to be tailored to a specific game, but generating a faithful encoding of a game is non-trivial, and using a static dataset is typically insufficient. For example, Dittadi, Drachmann, and Bolander (2021) save screens that are reached by a Rollout-IW agent using a hand-coded B-PROST feature set, and use those screens to train a Binary-Concrete VAE to produce a game-specific encoding. However, in order for the encoding to be representative of a game, the dataset must include screens from all visually distinct parts, such as separate levels.…”
Section: Online Representation Learning For Atarimentioning
confidence: 99%
See 3 more Smart Citations