2018
DOI: 10.1609/aaai.v32i1.12081
|View full text |Cite
|
Sign up to set email alerts
|

Generalized Value Iteration Networks:Life Beyond Lattices

Abstract: In this paper, we introduce a generalized value iteration network (GVIN), which is an end-to-end neural network planning module. GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs. We propose three novel differentiable kernels as graph convolution operators and show that the embedding-based kernel achieves the best performance. Furthermore, we present episodic Q-learning, an improvement upon traditional n-ste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 24 publications
(13 citation statements)
references
References 16 publications
0
13
0
Order By: Relevance
“…Value Iteration Networks [54] and Gated Path Planning Networks [37] apply convolutional neural networks (CNN) on discrete maps, then predict the policy with a weighted attention sum over neighborhoods. Generalized Value Iteration Networks [43] and Velickovic et al…”
Section: Related Workmentioning
confidence: 99%
“…Value Iteration Networks [54] and Gated Path Planning Networks [37] apply convolutional neural networks (CNN) on discrete maps, then predict the policy with a weighted attention sum over neighborhoods. Generalized Value Iteration Networks [43] and Velickovic et al…”
Section: Related Workmentioning
confidence: 99%
“…Due to the importance of the planning feature in NNs, various works extended form [24] and proposed new differentiable reward and transition functions that can stabilize learning in the network. For example, the authors in [14] proposes a novel convolution operator to learn and plan on spatial and irregular graphs.…”
Section: Value Iteration Network and Applicationsmentioning
confidence: 99%
“…The Value Iteration Network (VIN) [1] is a CNN architecture to approximate several recursions of VI planning. It has been generalized to arbitrary graphs [2] and multiple levels of hierarchical planning [3]. The instability issues, especially with respect to long planning horizons, due to the recurrent structure are addressed in [4] and [5].…”
Section: Related Workmentioning
confidence: 99%