2018
DOI: 10.1101/296335
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Better than maximum likelihood estimation of model-based and model-free learning style

Abstract: Multiple decision making systems work together to shape the final choices in human behavior.Habitual and goal-directed systems are the two most important systems that are studied in the reinforcement learning (RL) literature by model-free and model-based learning methods. Human behavior resembles the weighted combination of these systems and such a combination is modeled by weighted summation of action`s value from the model based and model free systems. Extraction of this weighted parameter, which is importan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 35 publications
0
2
0
Order By: Relevance
“…We created 200 simulation data points using a model with the following parameters: α , 0.1 ± 0.05 (mean ± SD); β , 1 ± 0.2; ν + and ν − , randomly selected within 0.01–0.95. Parameter estimation was quite accurate 39 (Supplementary Figure 1). Specifically, Pearson’s r and the mean absolute error between the true and estimated ν + or ν − were 0.99 and 0.03, respectively (Supplementary Figure 1).…”
Section: Methodsmentioning
confidence: 97%
See 1 more Smart Citation
“…We created 200 simulation data points using a model with the following parameters: α , 0.1 ± 0.05 (mean ± SD); β , 1 ± 0.2; ν + and ν − , randomly selected within 0.01–0.95. Parameter estimation was quite accurate 39 (Supplementary Figure 1). Specifically, Pearson’s r and the mean absolute error between the true and estimated ν + or ν − were 0.99 and 0.03, respectively (Supplementary Figure 1).…”
Section: Methodsmentioning
confidence: 97%
“…We created 200 simulation data points using a model with the following parameters: 𝛼, 0.1  0.05 (mean  SD); 𝛽, 1  0.2; 𝜈 + and 𝜈 − , randomly selected within 0.01-0.95. Parameter estimation was quite accurate 39 (Supplementary Figure 1).…”
Section: Model Validation and Parameter Estimation For The Delayed Feedback Taskmentioning
confidence: 97%