2022
DOI: 10.48550/arxiv.2203.12686
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Possibility Before Utility: Learning And Using Hierarchical Affordances

Abstract: Reinforcement learning algorithms struggle on tasks with complex hierarchical dependency structures. Humans and other intelligent agents do not waste time assessing the utility of every high-level action in existence, but instead only consider ones they deem possible in the first place. By focusing only on what is feasible, or "afforded", at the present moment, an agent can spend more time both evaluating the utility of and acting on what matters. To this end, we present Hierarchical Affordance Learning (HAL),… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…The purpose of this experiment is to validate the effectiveness of our algorithm in addressing the problem of decision difficulty caused by skill redundancy in long-sequence composite tasks. To achieve this, we plan to compare our algorithm with two prominent methods in hierarchical reinforcement learning: the classic Option-Critic algorithm and the HAL algorithm 16 , which has shown good performance in sparse reward tasks.…”
Section: Comparison Experimentsmentioning
confidence: 99%
“…The purpose of this experiment is to validate the effectiveness of our algorithm in addressing the problem of decision difficulty caused by skill redundancy in long-sequence composite tasks. To achieve this, we plan to compare our algorithm with two prominent methods in hierarchical reinforcement learning: the classic Option-Critic algorithm and the HAL algorithm 16 , which has shown good performance in sparse reward tasks.…”
Section: Comparison Experimentsmentioning
confidence: 99%