2018
DOI: 10.1101/499418
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Discovery of Hierarchical Representations for Efficient Planning

Abstract: SummaryWe propose that humans spontaneously organize environments into clusters of states that support hierarchical planning, enabling them to tackle challenging problems by breaking them down into sub-problems at various levels of abstraction. People constantly rely on such hierarchical presentations to accomplish tasks big and small – from planning one’s day, to organizing a wedding, to getting a PhD – often succeeding on the very first attempt. We formalize a Bayesian model … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
4

Relationship

5
3

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 116 publications
(148 reference statements)
0
12
0
Order By: Relevance
“…Over the course of learning, the model adaptively compresses the policy so that it achieves the highest reward rate subject to a constraint on the average number of bits used to specify the policy. The implications of adaptive policy compression are wide-reaching: in addition to explaining quantitative aspects of choice perseveration (Gershman, 2020), it may also provide a normative explanation for different forms of action and state chunking observed experimentally (e.g., Dezfouli and Balleine, 2012, Tomov et al, 2020). Finally, the cost-sensitive actor-critic model suggests a computational rationale for the massive compression factor in the mapping from cortex to striatum (Bar-Gad et al, 2003).…”
Section: Discussionmentioning
confidence: 99%
“…Over the course of learning, the model adaptively compresses the policy so that it achieves the highest reward rate subject to a constraint on the average number of bits used to specify the policy. The implications of adaptive policy compression are wide-reaching: in addition to explaining quantitative aspects of choice perseveration (Gershman, 2020), it may also provide a normative explanation for different forms of action and state chunking observed experimentally (e.g., Dezfouli and Balleine, 2012, Tomov et al, 2020). Finally, the cost-sensitive actor-critic model suggests a computational rationale for the massive compression factor in the mapping from cortex to striatum (Bar-Gad et al, 2003).…”
Section: Discussionmentioning
confidence: 99%
“…This work highlights the fact that many sequential decision problems contain repeated sets of actions that are related to solving the same subgoals. At a high level, many of the computational approaches for learning which subgoals are useful for a given task involve identifying commonly traversed task states that serve as bottlenecks between large sets of similar states (Hengst, 2012; Tomov, Yagati, Kumar, Yang, & Gershman, 2020). For example, a doorway between two rooms represents a bottleneck linking any state in the first room to any state in the second room.…”
Section: Task Representationsmentioning
confidence: 99%
“…When combined with the GP framework, it allows us to make Bayesian predictions about unobserved nodes. Even though previous work has investigated how people learn the relational structure of a graph (Kemp & Tenenbaum, 2008;Kemp, Tenenbaum, Griffiths, Yamada, & Ueda, 2006;Tomov, Yagati, Kumar, Yang, & Gershman, 2018), or infer properties of unobserved inputs (Kemp & Tenenbaum, 2009;Kemp, Shafto, & Tenenbaum, 2012), less is known about how people learn functions in discrete spaces with real-valued outputs.…”
Section: Goals and Scopementioning
confidence: 99%
“…Previous work has also investigated how people perform inference over graphs (Kemp & Tenenbaum, 2009, 2008Shafto, Kemp, Baraff, Coley, & Tenenbaum, 2005;Tomov et al, 2018). Whereas these studies were geared towards probing how people inferred underlying structure (Kemp & Tenenbaum, 2008) and how (implicit or explicit) represen-tations of structure influenced causal property judgments (Kemp & Tenenbaum, 2009;Shafto et al, 2005), the goal of our Subway Prediction Task was to study how people perform functional inference given explicit knowledge of a relational structure.…”
Section: Related Workmentioning
confidence: 99%