2019 Fifth Indian Control Conference (ICC) 2019
DOI: 10.1109/indiancc.2019.8715603
|View full text |Cite
|
Sign up to set email alerts
|

Neural-Linear Architectures for Sequential Decision Making

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 0 publications
0
10
0
Order By: Relevance
“…In certain domains, the number of actions available to an agent is large, which can greatly affect scalability. Previous work in RL has considered decomposing actions into independent sub-actions [49], generalizing across similar actions by embedding them in a continuous space [50], or learning which actions to eliminate via supervision provided by the environment [51]. Existing approaches in planning consider progressively widening the search based on a heuristic [52] or learning a partial policy for eliminating actions in the search tree [53].…”
Section: Proposed Methodsmentioning
confidence: 99%
“…In certain domains, the number of actions available to an agent is large, which can greatly affect scalability. Previous work in RL has considered decomposing actions into independent sub-actions [49], generalizing across similar actions by embedding them in a continuous space [50], or learning which actions to eliminate via supervision provided by the environment [51]. Existing approaches in planning consider progressively widening the search based on a heuristic [52] or learning a partial policy for eliminating actions in the search tree [53].…”
Section: Proposed Methodsmentioning
confidence: 99%
“…However, these approaches require the assumption that actions have dense semantic information, consist of natural language, and cannot be applied to general high-dimensional tasks. Some works propose solutions for generic large-scale action spaces, such as dividing the action space by using multiple hierarchical policies similar to a tree structure to reduce the action dimension of each layer of the policy [10,14,96]; or gradually increasing the action space employing curriculum learning so that the policy only needs to be optimized in a smaller action space in the early stage [20].…”
Section: A2 Structured or Large-scale Actionsmentioning
confidence: 99%
“…In addition, the template-based action space is introduced where the agent selects first a template, and then a verb-object pair either individually (Hausknecht et al, 2020) or conditioned on the selected template (Ammanabrolu and Hausknecht, 2020). Even using the reduced action space, approaches filtering unnecessary actions can further improve the computational tractability and speed up the learning convergence (Zahavy et al, 2018;Jain et al, 2020).…”
Section: Combinatorial Action Space In Tbgsmentioning
confidence: 99%
“…For example, some works consider a set of currently admissible actions (He et al, 2016), or a template-based action space (Hausknecht et al, 2020). Alternatively, some other works alleviated this challenge by filtering inadmissible actions through methods such as action affordance (Jain et al, 2020), bandit-based elimination (Zahavy et al, 2018) and rule-based scoring (Ammanabrolu and Riedl, 2019).…”
Section: Introductionmentioning
confidence: 99%