Human visual exploration reduces uncertainty about the sensed world

Mirza, M. Berk; Adams, Rick A.; Mathys, Christoph; Friston, Karl J.

doi:10.1371/journal.pone.0190429

Cited by 82 publications

(76 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Because the CRP is exchangeable, it will have the same 34 predictive error for the sequences ABCDABCD and ADBCDBAC. Order-dependent predictability 35 is beyond of the scope of the current work. 36 We evaluated the CRP on X (n) for n = [1, 100] and compared it to a naïve guess (uniform 37 distribution over M).…”

mentioning

confidence: 95%

“…Of course, other decompositions are also possible and useful (and not mutually 599 exclusive). For example, the state-outcome and action-dependent state transition functions of the 600 active inference framework can both be decomposed into "what" and "where" aspects [34,35].…”

mentioning

confidence: 99%

“…The state-action value function is used to generate a policy via an epsilon-greedy exploration 34 rule where the action with the highest valueQ(x = x t , a) was chosen with probability 1 ✏ and a 35 random action was chosen with probability ✏ (ties were broken with equal probability). 36 Simulation 1 37 We first simulated the three agents on the same 150 random tasks presented in Simulation 1 of the 38 manuscript.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Compositional clustering in task structure learning

Franklin

Frank

2017

Preprint

View full text Add to dashboard Cite

Humans are remarkably adept at generalizing knowledge between experiences in a way that can be difficult for computers. Often, this entails generalizing constituent pieces of experiences that do not fully overlap, but nonetheless share useful similarities with, previously acquired knowledge. However, it is often unclear how knowledge gained in one context should generalize to another. Previous computational models and data suggest that rather than learning about each individual context, humans build latent abstract structures and learn to link these structures to arbitrary contexts, facilitating generalization. In these models, task structures that are more popular across contexts are more likely to be revisited in new contexts. However, these models can only re-use policies as a whole and are unable to transfer knowledge about the transition structure of the environment even if only the goal has changed (or vice-versa). This contrasts with ecological settings, where some aspects of task structure, such as the transition function, will be shared between context separately from other aspects, such as the reward function. Here, we develop a novel non-parametric Bayesian agent that forms independent latent clusters for transition and reward functions, affording separable transfer of their constituent parts across contexts. We show that the relative performance of this agent compared to an agent that jointly clusters reward and transition functions depends environmental task statistics: the mutual information between transition and reward functions and the stochasticity of the observations. We formalize our analysis through an information theoretic account of the priors, and propose a meta learning agent that dynamically arbitrates between strategies across task domains to optimize a statistical tradeoff. Author summaryA musician may learn to generalize behaviors across instruments for different purposes, for example, reusing hand motions used when playing classical on the flute to play jazz on the saxophone. Conversely, she may learn to play a single song across many instruments that require completely distinct physical motions, but nonetheless transfer knowledge between them. This degree of compositionality is often absent from computational frameworks of learning, forcing agents either to generalize entire learned policies or to learn new policies from scratch. Here, we propose a solution to this problem that allows an agent to generalize components of a policy independently and compare it to an agent that generalizes components as a whole. We show that the degree to which one form of generalization is favored over the other is dependent on the features of task domain, with independent generalization of task components favored in environments with weak relationships between components or high degrees of noise and joint generalization of task components favored when there is a clear, discoverable relationship between task components.

show abstract

mentioning

confidence: 95%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Compositional clustering in task structure learning

Franklin

Frank

2017

Preprint

View full text Add to dashboard Cite

show abstract

“…One key ingredient to this process is the (internal) representation of 144 counterfactual predictions, that is, the probable consequences of possible hypothesis as they 145 would be realized into actions (here, saccades). Following such an active inference scheme 146 numerical simulations reproduce a sequence of eye movements that fit well with empirical 147 data [MAMF18]. As such, saccades are not the output of a value-based cost function such as 148 a saliency map, but are the consequence of an active strategy by the agent to minimize the 149 uncertainty about his beliefs, knowing his priors on the generative model of the visual world.…”

mentioning

confidence: 75%

A dual foveal-peripheral visual processing model implements efficient saccade selection

Daucé

Albiges

Perrinet

2019

Preprint

View full text Add to dashboard Cite

Visual search involves a dual task of localizing and categorizing an object in the visual field of view. We develop a visuo-motor model that implements visual search as a focal accuracyseeking policy, and we assume that the target position and category are random variables which are independently drawn from a common generative process. This independence allows to divide the visual processing in two pathways that respectively infer what to see and where to look, consistently with the anatomical What versus Where separation. We use this dual principle to train a deep neural network architecture with the foveal accuracy used as a monitoring signal for action selection. This allows in particular to interpret the Where network as a retinotopic action selection pathway, that drives the fovea toward the target position in order to increase the recognition accuracy by the What network. After training, the comparison of both networks accuracies amounts either to select a saccade or to keep the eye focused at the center, so as to identify the target. We test this on a simple task of finding digits in a large, cluttered image. A biomimetic log-polar treatment of the visual information implements the strong compression rate performed at the sensor level by retinotopic encoding, and is preserved up to the action selection level. Simulation results demonstrate that it is possible to learn this dual network. After training, this dual approach provides ways to implement visual search in a sub-linear fashion, in contrast with mainstream computer vision. Author summaryThe visual search task consists in extracting a scarce and specific visual information (the 1 "target") from a large and cluttered visual display. In computer vision, this task is usually 2 implemented by scanning all different possible target identities in parallel at all possible 3 spatial positions, hence with strong computational load. The human visual system employs a 4 different strategy, combining a foveated sensor with the capacity to rapidly move the center 5 of fixation using saccades. Then, visual processing is separated in two specialized pathways, 6 the "where" pathway mainly conveying information about target position in peripheral space 7 (independently of its category), and the "what" pathway mainly conveying information about 8

show abstract

“…However, it conflicts with the aim (preference) of avoiding vulnerable self-defensive state outcomes. When the latter aim wins out, heuristically, the stage is set for further epistemic foraging (Mirza et al, 2018).…”

Section: Distributions Of Policy Selections Over 4 Variants 5 Conditmentioning

confidence: 99%

Active inference, stressors, and psychological trauma: A neuroethological model of (mal)adaptive explore-exploit dynamics in ecological context

Linson

Parr

Friston

2019

Preprint

Self Cite

View full text Add to dashboard Cite

This paper offers a formal account of emotional inference and stress-related behaviour, using the notion of active inference. We formulate responses to stressful scenarios in terms of Bayesian belief-updating and subsequent policy selection; namely, planning as (active) inference. Using a minimal model of how creatures or subjects account for their sensations (and subsequent action), we deconstruct the sequences of belief updating and behaviour that underwrite stress-related responses -and simulate the aberrant responses of the sort seen in posttraumatic stress disorder (PTSD). Crucially, the model used for belief-updating generates predictions in multiple (exteroceptive, proprioceptive and interoceptive) modalities, to provide an integrated account of evidence accumulation and multimodal integration that has consequences for both motor and autonomic responses. The ensuing phenomenology speaks to many constructs in the ecological and clinical literature on stress, which we unpack with reference to simulated inference processes and accompanying neuronal responses. A key insight afforded by this formal approach rests on the trade-off between the epistemic affordance of certain cues (that resolve uncertainty about states of affairs in the environment) and the consequences of epistemic foraging (that may be in conflict with the instrumental or pragmatic value of 'fleeing' or 'freezing'). Starting from first principles, we show how this trade-off is nuanced by prior (subpersonal) beliefs about the outcomes of behaviour -beliefs that, when held with unduly high precision, can lead to (Bayes optimal) responses that closely resemble PTSD.

show abstract

Human visual exploration reduces uncertainty about the sensed world

Cited by 82 publications

References 23 publications

Compositional clustering in task structure learning

Compositional clustering in task structure learning

A dual foveal-peripheral visual processing model implements efficient saccade selection

Active inference, stressors, and psychological trauma: A neuroethological model of (mal)adaptive explore-exploit dynamics in ecological context

Contact Info

Product

Resources

About