2019
DOI: 10.1101/797407
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Imitation as a model-free process in human reinforcement learning

Abstract: While there is not doubt that social signals affect human reinforcement learning, there is still no consensus about their exact computational implementation. To address this issue, we compared three hypotheses about the algorithmic implementation of imitation in human reinforcement learning. A first hypothesis, decision biasing, postulates that imitation consists in transiently biasing the learner's action selection without affecting her value function. According to the second hypothesis, model-based imitation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
1
1
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…The computational questions covered in this survey extend beyond the boundaries of Artificial Intelligence, as similar research questions regarding the computational implementation of social learning strategies are also raised in the field of Cognitive Neuroscience [10,87,96]. Thus we think this survey can be of interest for both communities.…”
Section: Discussionmentioning
confidence: 95%
“…The computational questions covered in this survey extend beyond the boundaries of Artificial Intelligence, as similar research questions regarding the computational implementation of social learning strategies are also raised in the field of Cognitive Neuroscience [10,87,96]. Thus we think this survey can be of interest for both communities.…”
Section: Discussionmentioning
confidence: 95%
“…The future direction would be to extend the framework to include more neural populations, representing more complications of the ToM circuits in human brain, and a history of recent events. Given the recent debates on predictive coding and value shaping hypotheses in the context of ToM and social cognition (see 42,43 for instance), it is also of great importance to test if these hypotheses can be explained by an extended ToM-based ImRL framework.…”
Section: Discussionmentioning
confidence: 99%
“…The future direction would be to extend the framework to include more neural populations, representing more complications of the ToM circuits in human brain, and a history of recent events. Given the recent debates on predictive coding and value shaping hypotheses in the context of ToM and social cognition (see 40 and 41 for instance), it is also of great importance to test if these hypotheses can be explained by an extended ToM-based ImRL framework.…”
Section: Discussionmentioning
confidence: 99%
“…As a result, for each group of components, we respectively have a three-dimensional tensor of shape (n, h, d), where n, h, and d respectively denote the number of features (7 for obstacles, 4 for side margins, 3 for oriented margins, and 1 for shot distances), the height of the frame, and the maximum horizontal distance from the plane/shot. All together, there would be four independent input populations: one for the obstacles of shape (7, 101, 175), one for the side margins of shape (4,25,17), one for the oriented margins of shape (3,101,41), and one for the distance of obstacles from the shot of shape (1,101,21). Note that the lowest bar in the game frame is not considered in ToM-based agent's visual access to reduce the complexity of the network's input (see Figure 5).…”
Section: Network Architecturementioning
confidence: 99%