2012
DOI: 10.1162/neco_a_00282
|View full text |Cite
|
Sign up to set email alerts
|

The Successor Representation and Temporal Context

Abstract: The successor representation was introduced into reinforcement learning by Dayan ( 1993 ) as a means of facilitating generalization between states with similar successors. Although reinforcement learning in general has been used extensively as a model of psychological and neural processes, the psychological validity of the successor representation has yet to be explored. An interesting possibility is that the successor representation can be used not only for reinforcement learning but for episodic learning as … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
145
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 123 publications
(149 citation statements)
references
References 48 publications
4
145
0
Order By: Relevance
“…One possibility, borrowing ideas from reinforcement learning, is to directly update an estimate the SR (M̂) from state transitions. Specifically, the SR can be updated incrementally using a form of temporal difference learning [26]:…”
Section: Learning the Successor Representation With Temporal Contextmentioning
confidence: 99%
See 1 more Smart Citation
“…One possibility, borrowing ideas from reinforcement learning, is to directly update an estimate the SR (M̂) from state transitions. Specifically, the SR can be updated incrementally using a form of temporal difference learning [26]:…”
Section: Learning the Successor Representation With Temporal Contextmentioning
confidence: 99%
“…According to TCM, items are bound in memory to a slowly drifting representation of temporal context (a recency-weighted average of previous items), and at test the temporal context acts as a retrieval cue, preferentially drawing items based on their strength of association. It can be shown (see [26] for details) that the temporal context representation corresponds to the eligibility trace e t , and the matrix of item-context associations corresponds to M̂.…”
Section: Learning the Successor Representation With Temporal Contextmentioning
confidence: 99%
“…The SR is essentially identical to the fundamental matrix in the theory of Markov chains (Kemeny & Snell, 1976). More recently, Gershman, Moore, Todd, Norman, and Sederberg (2012) identified a formal connection between the SR and an influential model of episodic and semantic memory, the Temporal Context Model (e.g. Howard & Kahana, 2002; Sederberg, Howard, & Kahana, 2008).…”
Section: Figure A1mentioning
confidence: 99%
“…This idea is embodied in retrieved context models of episodic memory such as the temporal context model of Howard and Kahana (2002a) and its more recent variants (e.g., Sederberg et al, 2008; Polyn et al, 2009; Sederberg, Gershman, Polyn, & Norman, 2011; Howard, Kahana, & Wingfield, 2006; Gershman, Moore, Todd, Norman, & Sederberg, 2012). According to these models, the context cue used for recall of items contains a recency-weighted sum of previously activated cognitive states, and as such predict that the temporal contiguity effect should be enhanced when a sequence of previously recalled items were studied at neighboring list positions.…”
Section: Simulation 1: Context Maintenance and Retrieval Modelmentioning
confidence: 99%