2021
DOI: 10.1101/2021.07.20.453122
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hierarchical clustering optimizes the tradeoff between compositionality and expressivity of task structures for flexible reinforcement learning

Abstract: A hallmark of human intelligence is our ability to compositionally generalise: that is, to recompose familiar knowledge components in novel ways to solve new problems. For instance, a talented musician can conceivably transfer her knowledge of flute fingerings and guitar songs to play guitar music on a piccolo for the first time. Yet there are also instances where it can be helpful to learn and transfer not just individual task components, but entire structures or substructures, particularly whenever these rec… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 54 publications
0
2
0
Order By: Relevance
“…For example, 'adding salt' can be a subtask that starts upon tasting bland food, continues with a policy that includes reaching for the salt shaker, grasping it, and shaking it over the food, and ends when the subgoal is reached: there is salt on the food. Subtasks can be used across tasks 89,96,97 (e.g., 'adding salt' is used by 'dining at a restaurant' and 'eating at home'). The term "subgoal" distinguishes the termination state of the subtask (food is salted) from the overall goal of the task (having a full stomach).…”
Section: Schema Hierarchies Might Be Learned and Instantiated Via Hie...mentioning
confidence: 99%
“…For example, 'adding salt' can be a subtask that starts upon tasting bland food, continues with a policy that includes reaching for the salt shaker, grasping it, and shaking it over the food, and ends when the subgoal is reached: there is salt on the food. Subtasks can be used across tasks 89,96,97 (e.g., 'adding salt' is used by 'dining at a restaurant' and 'eating at home'). The term "subgoal" distinguishes the termination state of the subtask (food is salted) from the overall goal of the task (having a full stomach).…”
Section: Schema Hierarchies Might Be Learned and Instantiated Via Hie...mentioning
confidence: 99%
“…As we increasingly confront a bombardment of moral issues on the national and international stage, from how we deal with an emerging climate crisis to social justice issues such as racial reparations, it has never been more important to understand the ways in which human moral judgments systematically emerge and are updated from the social inputs around us. Although social computational models of learning are still in their infancy, formal models of learning are becoming increasingly sophisticated in their ability to capture the building blocks of human learning and reasoning that are relevant to the moral domain, such as concept abstraction (Gershman et al, 2015;R. G. Liu & Frank, 2021;Tenenbaum et al, 2011) and mental simulation (Ho et al, 2022).…”
Section: Bayesian Models In Moral Contextsmentioning
confidence: 99%