A mixture of generative models strategy helps humans generalize across tasks

Castañón, Santiago Herce; Cardoso-Leite, Pedro; Altarelli, Irène; Green, C. Shawn; Schrater, Paul; Bavelier, Daphné

doi:10.1101/2021.02.16.431506

Cited by 3 publications

(4 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Social learning is one of the core human capacities that have been vital in both growth and habitat expansion of human populations 8,9,33 . Previous research has shown that humans flexibly use various social learning strategies in response to the adaptive features of an environment 10,11 . Previous research has also used the exploration-exploitation tradeoff in information search 27,28 as a common platform to clarify computational algorithms underlying social learning.…”

Section: Discussionmentioning

confidence: 99%

“…1 top). If such generalization is warranted (i.e., the old and new environments are structured or generated according to a common rule [10][11][12][13] , decision makers can solve the exploration-exploitation tradeoff in the new environments more efficiently. Although the question of knowledge generalizability has been discussed for many years 14,15 , it has remained largely unanswered because of the computational difficulty of quantifying its cognitive underpinnings in detail.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Insights about the common generative rule underlying an information foraging task can be facilitated via collective search

Naito¹,

Katahira²,

Kameda³

2022

Preprint

View full text Add to dashboard Cite

Social learning is beneficial for efficient information search in unfamiliar environments (“within-task” learning). In the real world, however, possible search spaces are often so large that decision makers are incapable of covering all options, even if they pool their information collectively. One strategy to handle such overload is developing generalizable knowledge that extends to multiple related environments (“across-task” learning). However, it is unknown whether and how social information may facilitate such across-task learning. Here, we investigated participants’ social learning processes across multiple laboratory foraging sessions in spatially correlated reward landscapes that were generated according to a common rule. The results showed that paired participants were able to improve efficiency in information search across sessions more than solo participants. Computational analysis of participants’ choice-behaviors revealed that such improvement across sessions was related to better understanding of the common generative rule. Rule understanding was correlated within a pair, suggesting that social interaction is a key to the improvement of across-task learning.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Insights about the common generative rule underlying an information foraging task can be facilitated via collective search

Naito¹,

Katahira²,

Kameda³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Studies in the second group deal with the problem of how to use a structured representation for generalization purposes. These studies investigate how humans generalize through property induction (Kemp & Tenenbaum 2009), how they use learned reward functions for generalization during search tasks in spatially or conceptually correlated and graph-structured reward environments (Castañón et al 2021;Wu et al 2018Wu et al , 2020, and how they can learn how to generalize (Austerweil et al 2019).…”

Section: Rule/abstract Learningmentioning

confidence: 99%

Statistical Learning in Vision

Fiser

Lengyel

2022

Annu. Rev. Vis. Sci.

View full text Add to dashboard Cite

Vision and learning have long been considered to be two areas of research linked only distantly. However, recent developments in vision research have changed the conceptual definition of vision from a signal-evaluating process to a goal-oriented interpreting process, and this shift binds learning, together with the resulting internal representations, intimately to vision. In this review, we consider various types of learning (perceptual, statistical, and rule/abstract) associated with vision in the past decades and argue that they represent differently specialized versions of the fundamental learning process, which must be captured in its entirety when applied to complex visual processes. We show why the generalized version of statistical learning can provide the appropriate setup for such a unified treatment of learning in vision, what computational framework best accommodates this kind of statistical learning, and what plausible neural scheme could feasibly implement this framework. Finally, we list the challenges that the field of statistical learning faces in fulfilling the promise of being the right vehicle for advancing our understanding of vision in its entirety. Expected final online publication date for the Annual Review of Vision Science, Volume 8 is September 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

show abstract

“…Animals thrive in a constantly changing environmental demands at many time scales. Biological brains seem capable of using these changes advantageously and leverage the temporal structure to learn causal and well-factorized representations (Collins and Koechlin, 2012;Yu et al, 2021;Herce Castañón et al, 2021). In contrast, traditional neural networks suffer in such settings with sequential experience and display prominent interference between old and new learning limiting most training paradigms to using shuffled data (McCloskey and Cohen, 1989).…”

Section: Introductionmentioning

confidence: 99%

Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representations

Hummos¹

2022

Preprint

View full text Add to dashboard Cite

Animals thrive in a constantly changing environment and leverage the temporal structure to learn well-factorized causal representations. In contrast, traditional neural networks suffer from forgetting in changing environments and many methods have been proposed to limit forgetting with different trade-offs. Inspired by the brain thalamocortical circuit, we introduce a simple algorithm that uses optimization at inference time to generate internal representations of temporal context and to infer current context dynamically, allowing the agent to parse the stream of temporal experience into discrete events and organize learning about them. We show that a network trained on a series of tasks using traditional weight updates can infer tasks dynamically using gradient descent steps in the latent task embedding space (latent updates). We then alternate between the weight updates and the latent updates to arrive at Thalamus, a task-agnostic algorithm capable of discovering disentangled representations in a stream of unlabeled tasks using simple gradient descent. On a continual learning benchmark, it achieves competitive end average accuracy and demonstrates knowledge transfer. After learning a subset of tasks it can generalize to unseen tasks as they become reachable within the well-factorized latent space, through one-shot latent updates. The algorithm meets many of the desiderata of an ideal continually learning agent in open-ended environments, and its simplicity suggests fundamental computations in circuits with abundant feedback control loops such as the thalamocortical circuits in the brain.Preprint. Under review.

show abstract

A mixture of generative models strategy helps humans generalize across tasks

Cited by 3 publications

References 40 publications

Insights about the common generative rule underlying an information foraging task can be facilitated via collective search

Insights about the common generative rule underlying an information foraging task can be facilitated via collective search

Statistical Learning in Vision

Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representations

Contact Info

Product

Resources

About