The attentional learning theory of Pearce and Hall (1980) predicts more attention to uncertain cues that have caused a high prediction error in the past. We examined how the cue-elicited pupil dilation during associative learning was linked to such errordriven attentional processes. In three experiments, participants were trained to acquire associations between different cues and their appetitive (Experiment 1), motor (Experiment 2), or aversive (Experiment 3) outcomes. All experiments were designed to examine differences in the processing of continuously reinforced cues (consistently followed by the outcome) versus partially reinforced, uncertain cues (randomly followed by the outcome). We measured the pupil dilation elicited by the cues in anticipation of the outcome and analyzed how this conditioned pupil response changed over the course of learning. In all experiments, changes in pupil size complied with the same basic pattern: During early learning, consistently reinforced cues elicited greater pupil dilation than uncertain, randomly reinforced cues, but this effect gradually reversed to yield a greater pupil dilation for uncertain cues toward the end of learning. The pattern of data accords with the changes in prediction error and error-driven attention formalized by the Pearce-Hall theory.
K E Y W O R D Sassociative learning, attention, human fear conditioning, pupil dilation, reward
| IN TRO DUCT IO NAssociative learning theories assume that repeated observations of the co-occurrence of two external events will lead to the formation of an association that links the representation of both events in memory (Pearce & Bouton, 2001;Rescorla, 1988). However, the association between cues and a significant outcome like reward (or punishment) often is acquired in an uncertain environment, where the contingency between cues and the outcome is not perfect but rather probabilistic. From this perspective, the learning process involves two important variables: (a) the expectancy of the outcome given the cue, and (b) the uncertainty of that expectation. For example, if three different cues were followed by reward on 0%, 50%, and 100% of their past occurrences, these cues differ with respect to reward expectancy and uncertainty. The 0% cue elicits low reward expectancy and low uncertainty. The 100% cue elicits high reward expectancy and low uncertainty. The 50% cue is partially reinforced and elicits an intermediate reward expectancy but high uncertainty whether reward will occur or not. From the perspective of associative learning theories, this uncertainty results from the fact that the 50% cue in the past had been associated with a higher prediction error, whereas in contrast this error in predicting the omission or occurrence of the outcome was small for the 0% cue and the 100% cue.