14The neurotransmitter dopamine is implicated in diverse functions, including reward 15 processing, reinforcement learning and cognitive control. The tendency to discount future 16 rewards in value over time has long been discussed in the context of potential dopaminergic 17 modulation. Here we examined the effect of a single dose of the D2 receptor antagonist 18 Haloperidol (2mg) on temporal discounting. Our approach extends previous human 19 pharmacological studies in two ways. First, we applied state-of-the-art computational 20 modeling based on the drift diffusion model to comprehensively examine choice dynamics.
21Second, we examined dopaminergic modulation of reward magnitude effects on temporal 22 discounting. Drift diffusion modeling revealed reduced temporal discounting and substantially 23 faster non-decision times under Haloperidol. Temporal discounting was substantially 24 increased for low vs. high reward magnitudes, but this magnitude effect was largely 25 unaffected by Haloperidol. These results were corroborated by model-free analyses as well as 26 modeling via more standard approaches using softmax action selection. We previously 27 reported elevated caudate activation under Haloperidol in this sample of participants, 28 supporting the idea that Haloperidol elevated dopamine neurotransmission, e.g. by blocking 29 inhibitory feedback via presynaptic D2 autoreceptors. The present modeling results show that 30 during inter-temporal choice, this leads to attenuated temporal discounting and increased 31 response vigor (shorter non-decision times).32 33planning is thought to rely on anticipatory mechanisms that compute future outcomes based 68 on an environmental state-space, whereas such explicit forward planning is not required for 69 model-free RL. While one study found no association between model-based RL and temporal 70 discounting 19 , another large-scale online study observed the predicted positive association: 71 participants with a stronger contribution of model-based planning to RL also showed reduced 72 temporal discounting 20 . However, in contrast to temporal discounting, where (as mentioned 73 above) elevation of DA levels via the catecholamine precursor L-DOPA lead to more 74 impulsive decision-making and steeper discounting 13 (though results in a larger sample were 75 considerably more mixed 14 ), L-DOPA instead increased reliance on model-based RL in 76 controls 21 and Parkinson's disease patients 22 . However, a recent study in a substantially larger 77 sample (n=65) did not replicate this overall effect of L-DOPA on model-based control 23 .
78Rather, in this study the effect of L-DOPA was restricted to a subset of participants with high 79 working memory capacity.
80Several task-related and contextual factors affect the degree of temporal 81 discounting 1,24 . One well-replicated behavioral effect, the magnitude effect, refers to the 82 observation that the rate of temporal discounting decreases with increasing reward 83 magnitude 25 . In humans, this effect depends on lateral p...