Prefrontal cortex and decision making in a mixed-strategy game

Barraclough, Dominic J.; Conroy, Michelle; Lee, Dongsoo

doi:10.1038/nn1209

Cited by 630 publications

(643 citation statements)

References 38 publications

Supporting

Mentioning

600

Contrasting

Unclassified

Order By: Relevance

“…This crossover interaction suggests that value increases the distance between the response distributions in these two populations of neurons, which should increase the probability of correctly discriminating the orientation of the stimulus (Green and Swets 1966;see also Martinez-Trujillo and Treue 2004). In turn, this increase in discriminability should promote a more rapid accumulation of sensory evidence concerning the identity of valuable stimuli in downstream decision mechanisms, almost as if the physical clarity or distinctiveness of the stimulus was enhanced (Beck et al 2008;Carrasco and McElree 2001;Carrasco et al 2004;Gold andShadlen 2002, 2007;Navalpakkam and Itti 2007;Newsome et al 1989).…”

Section: Value and Population Responses In Human Visual Cortexmentioning

confidence: 99%

“…However, reasonable candidates include dopaminergic (DA) neurons in the ventral tegemental area (VTA) and the substantia nigra pars compacta (SNc) that might influence activity in visual cortex via direct projections to early areas of visual cortex (Berger et al 1988(Berger et al , 1991Devoto and Flore 2006). However, these projections are generally thought to be sparse, so it is likely that indirect DA signals relayed through the striatum and then to frontal and parietal cortex play an important role in regulating value-related changes in early visual cortex (Barraclough et al 2004;Ding and Hikosaka 2006;Dorris and Glimcher 2004;Gläscher et al 2009;Glimcher 2003;Hikosaka et al 2008;Hollerman and Schultz 1998;Ikeda and Hikosaka 2003;Lau and Glimcher 2007;Leon and Shadlen 1999;Luk and Wallis 2009;Platt and Glimcher 1999;Schultz and Dickinson 2000;Seo et al 2007;Sugrue et al 2004;Wallis and Miller 2003;Watanabe 1996). Indeed, many of the cortical targets of reward signals-such as oculomotor neurons in frontal and parietal cortex-are ideally situated to send modulatory feedback signals to earlier sensory areas so that the cortical representation of high-value stimulus features can be enhanced (Bisley and Goldberg 2003;Ding and Hikosaka 2006;Gold and Shadlen 2007;Serences and Yantis 2006;Shadlen and Newsome 2001).…”

Section: Value and Population Responses In Human Visual Cortexmentioning

confidence: 99%

“…Here we show that feature-selective population response profiles in early visual cortex are biased in favor of stimuli deemed more likely to yield a monetary reward based on prior experience. These modulations in the population response profile may in turn increase the quality of inputs into regions of parietal and frontal cortex that integrate sensory evidence to form perceptual decisions (Beck et al 2008;Gold andShadlen 2002, 2007;Mazurek et al 2003;Roitman and Shadlen 2002;Shadlen and Newsome 2001). Future studies might investigate this relationship by systematically examining the influence of value on the ability of subjects to perform difficult visual discriminations under explicit speed pressure.…”

Section: General Conclusionmentioning

confidence: 99%

“…In addition, value modulates feature-selective response profiles associated with both the selected and the unselected stimuli on each trial, suggesting that simple space-and feature-based attention explanations cannot account for the present results. These modulations in response selectivity may facilitate the accumulation of information in the downstream areas that compute decisions about how and when to interact with behaviorally relevant objects in the environment (Beck et al 2008;Gold and Shadlen 2007).…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Population Response Profiles in Early Visual Cortex Are Biased in Favor of More Valuable Stimuli

Serences

Saproo

2010

Journal of Neurophysiology

104

View full text Add to dashboard Cite

Serences JT, Saproo S. Population response profiles in early visual cortex are biased in favor of more valuable stimuli. J Neurophysiol 104: 76 -87, 2010. First published April 21, 2010 doi:10.1152/jn.01090.2009. Voluntary and stimulus-driven shifts of attention can modulate the representation of behaviorally relevant stimuli in early areas of visual cortex. In turn, attended items are processed faster and more accurately, facilitating the selection of appropriate behavioral responses. Information processing is also strongly influenced by past experience and recent studies indicate that the learned value of a stimulus can influence relatively late stages of decision making such as the process of selecting a motor response. However, the learned value of a stimulus can also influence the magnitude of cortical responses in early sensory areas such as V1 and S1. These early effects of stimulus value are presumed to improve the quality of sensory representations; however, the nature of these modulations is not clear. They could reflect nonspecific changes in response amplitude associated with changes in general arousal or they could reflect a bias in population responses so that high-value features are represented more robustly. To examine this issue, subjects performed a two-alternative forced choice paradigm with a variableinterval payoff schedule to dynamically manipulate the relative value of two stimuli defined by their orientation (one was rotated clockwise from vertical, the other counterclockwise). Activation levels in visual cortex were monitored using functional MRI and feature-selective voxel tuning functions while subjects performed the behavioral task. The results suggest that value not only modulates the relative amplitude of responses in early areas of human visual cortex, but also sharpens the response profile across the populations of feature-selective neurons that encode the critical stimulus feature (orientation). Moreover, changes in space-or feature-based attention cannot easily explain the results because representations of both the selected and the unselected stimuli underwent a similar feature-selective modulation. This sharpening in the population response profile could theoretically improve the probability of correctly discriminating high-value stimuli from low-value alternatives.

show abstract

Section: Value and Population Responses In Human Visual Cortexmentioning

confidence: 99%

Section: Value and Population Responses In Human Visual Cortexmentioning

confidence: 99%

Section: General Conclusionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Population Response Profiles in Early Visual Cortex Are Biased in Favor of More Valuable Stimuli

Serences

Saproo

2010

Journal of Neurophysiology

104

View full text Add to dashboard Cite

show abstract

“…manual selections), we have focused on saccadic eye movements, which have been extensively studied with regard to target selection (Schall and Thompson 1999). Our current Markov task, used to demonstrate learning, is most similar to free choice experiments used to demonstrate neural correlates of subjective valuation or reward probability in the lateral intraparietal area (Platt and Glimcher 1999;Sugrue et al 2004;Dorris and Glimcher 2004;Yang and Shadlen 2007) and prefrontal cortex (Barraclough et al 2004). However, Markov chains could be easily adapted to a reaction time task in which the Markov state would determine target probability.…”

Section: Discussionmentioning

confidence: 99%

Saccade selection when reward probability is dynamically manipulated using Markov chains

2008

View full text Add to dashboard Cite

Markov chains (stochastic processes where probabilities are assigned based on the previous outcome) are commonly used to examine the transitions between behavioral states, such as those that occur during foraging or social interactions. However, relatively little is known about how well primates can incorporate knowledge about Markov chains into their behavior. Saccadic eye movements are an example of a simple behavior influenced by information about probability, and thus are good candidates for testing whether subjects can learn Markov chains. In addition, when investigating the influence of probability on saccade target selection, the use of Markov chains could provide an alternative method that avoids confounds present in other task designs. To investigate these possibilities, we evaluated human behavior on a task in which stimulus reward probabilities were assigned using a Markov chain. On each trial, the subject selected one of four identical stimuli by saccade; after selection, feedback indicated the rewarded stimulus. Each session consisted of 200-600 trials, and on some sessions, the reward magnitude varied. On sessions with a uniform reward, subjects (n = 6) learned to select stimuli at a frequency close to reward probability, which is similar to human behavior on matching or probability classification tasks. When informed that a Markov chain assigned reward probabilities, subjects (n = 3) learned to select the greatest reward probability more often, bringing them close to behavior that maximizes reward. On sessions where reward magnitude varied across stimuli, subjects (n = 6) demonstrated preferences for both greater reward probability and greater reward magnitude, resulting in a preference for greater expected value (the product of reward probability and magnitude). These results demonstrate that Markov chains can be used to dynamically assign probabilities that are rapidly exploited by human subjects during saccade target selection.

show abstract