Rhythmic modulation of prediction errors: a possible role for the beta-range in speech processing

Hovsepyan, Sevada; Olasagasti, Itsaso; Giraud, Anne-Lise

doi:10.1101/2022.03.28.486037

Cited by 6 publications

(20 citation statements)

References 68 publications

(107 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The discrete portion of our model, or in theory any model with explicit structural and timing information [11,53], can provide a template for organizing distributed oscillatory activities into functional hierarchies through correlating latency-and frequency-specific neuronal dynamics with model-derived information metrics. In general, sensory inputs sampled by fast (gamma) oscillation are parsed into higher-level information as phase alignments of slow (theta, delta) oscillations [26,75,[78][79][80][81], which are found to be modulated by level-specific speech information [32,36,61] and top-down coordination of mid-range (alpha, beta) oscillations [77,78,[82][83][84][85][86]. One promising avenue that exploits both model-derived computational metrics and neural oscillations to disentangle neural information transfer is via a forward model that explains the neurophysiological signal as a result of input-modulated changes in direction-specific connection strengths between specific neural sources (brain areas), i.e., effective connectivity [87,88].…”

Section: Plos Biologymentioning

confidence: 99%

A deep hierarchy of predictions enables online meaning extraction in a computational model of human speech comprehension

MacGregor

Olasagasti

et al. 2023

PLoS Biol

View full text Add to dashboard Cite

Understanding speech requires mapping fleeting and often ambiguous soundwaves to meaning. While humans are known to exploit their capacity to contextualize to facilitate this process, how internal knowledge is deployed online remains an open question. Here, we present a model that extracts multiple levels of information from continuous speech online. The model applies linguistic and nonlinguistic knowledge to speech processing, by periodically generating top-down predictions and incorporating bottom-up incoming evidence in a nested temporal hierarchy. We show that a nonlinguistic context level provides semantic predictions informed by sensory inputs, which are crucial for disambiguating among multiple meanings of the same word. The explicit knowledge hierarchy of the model enables a more holistic account of the neurophysiological responses to speech compared to using lexical predictions generated by a neural network language model (GPT-2). We also show that hierarchical predictions reduce peripheral processing via minimizing uncertainty and prediction error. With this proof-of-concept model, we demonstrate that the deployment of hierarchical predictions is a possible strategy for the brain to dynamically utilize structured knowledge and make sense of the speech input.

show abstract

Section: Plos Biologymentioning

confidence: 99%

A deep hierarchy of predictions enables online meaning extraction in a computational model of human speech comprehension

MacGregor

Olasagasti

et al. 2023

PLoS Biol

View full text Add to dashboard Cite

show abstract

“…Although BRyBI shows promising results, it leads to multiple avenues for extensions and improvements through the implementation of more biological mechanisms for rhythm generation, the incorporation of phase-amplitude coupling (PAC) mechanisms, and considering the role of beta in the inference hierarchy [81].…”

Section: Discussionmentioning

confidence: 99%

“…Following the example of previous similar models [29,60,81], the GM splits each syllable into 8 parts. It allows more flexibility in shaping the auditory spectrogram of syllables and phonemes.…”

Section: The Bottom Levelmentioning

confidence: 99%

A brain-rhythm based computational framework for semantic context and acoustic signal integration in speech processing

Dogonasheva,

Doelling,

Zakharov

et al. 2024

Preprint

Self Cite

View full text Add to dashboard Cite

Unraveling the mysteries of how humans effortlessly grasp speech amidst diverse environmental challenges has long intrigued researchers in systems and cognitive neuroscience. This study delves into the neural intricacies underpinning robust speech comprehension, giving a computational mechanistic proof for the hypothesis proposing a pivotal role for rhythmic, predictive top-down contextualization facilitated by the delta rhythm in achieving time-invariant speech processing. We propose a Brain-Rhythm-Based Inference (BRyBI) model that integrates three key rhythmic processes - theta-gamma interactions for parsing phoneme sequences, dynamic delta rhythm for inferred prosodic-phrase context, and resilient speech representations. Demonstrating mechanistic proof-of-principle, BRyBI replicates human behavioral experiments, showcasing its ability to handle pitch variations, time-warped speech, interruptions, and silences in non-comprehensible contexts. Intriguingly, the model aligns with human experiments, revealing optimal silence time scales in the theta- and delta-frequency ranges. Comparative analysis with deep neural network language models highlights distinctive performance patterns, emphasizing the unique capabilities of our rhythmic framework. In essence, our study sheds light on the neural underpinnings of speech processing, emphasizing the role of rhythmic brain mechanisms in structured temporal signal processing - an insight that challenges prevailing artificial intelligence paradigms and hints at potential advancements in compact and robust computing architectures.

show abstract

“…For inter-level information, it is possible to decompose the KLD into bottom-up prediction errors and top-down priors (Friston et al, 2017), and apply known neurophysiological probes, e.g. neural oscillations, to distinguish these two flows in the brain (Bastos et al, 2012, Giraud and Arnal, 2018, Giraud and Poeppel, 2012, Arnal and Giraud, 2012, Bastos et al, 2020, Hovsepyan et al, 2022).…”

Section: Discussionmentioning

confidence: 99%

“…The discrete portion of our model, or in theory any model with explicit structural and timing information (11,53), can provide a template for organizing distributed oscillatory activities into functional hierarchies through correlating latency-and frequency-specific neuronal dynamics with model-derived information metrics. In general, sensory inputs sampled by fast (gamma) oscillation are parsed into higher-level information as phase alignments of slow (theta, delta) oscillations (26,76,(79)(80)(81)(82), which are found to be modulated by level-specific speech information (32,36,61) and top-down coordination of mid-range (alpha, beta) oscillations (78,79,(83)(84)(85)(86)(87). One promising avenue that exploits both model-derived computational metrics and neural oscillations to disentangle neural information transfer is via a forward model that explains the neurophysiological signal as a result of input-modulated changes in direction-specific connection strengths between specific neural sources (brain areas), i.e.…”

Section: Understanding Neural Information Transfer Through Divergence...mentioning

confidence: 99%

A deep hierarchy of predictions enables assignment of semantic roles in online speech comprehension

Olasagasti

Giraud

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Understanding speech requires mapping fleeting and often ambiguous soundwaves to meaning. Humans are known to exploit their capacity to contextualize to facilitate this process, but how internal knowledge is used and deployed in real time remains an open question. Existing models of speech processing focus on either word recognition irrespective of meaning or interactions among abstract linguistic representations without time constraints, providing only partial insights into the dynamics of speech comprehension. Here, we present a model that incrementally extracts multiple levels of information from continuous speech signals in real time, based on the inversion of a generative model that represents the listener’s internal knowledge of linguistic and non-linguistic processing levels in a nested temporal hierarchy. In each hierarchy, the model periodically incorporates bottom-up incoming evidence to update its internal representations and generate new top-down predictions. We show that a context level, beyond linguistic representations, can provide the model with semantic predictions informed by sensory inputs, crucial for the disambiguation among multiple meanings of the same word. We also show that hierarchical predictions can reduce peripheral processing effort via minimizing uncertainty and prediction error, especially when sensory precisions become degraded. With this proof-of-concept model we demonstrate that the deployment of hierarchical predictions is a possible strategy for the brain to utilize structured knowledge dynamically for speech comprehension.

show abstract

Rhythmic modulation of prediction errors: a possible role for the beta-range in speech processing

Cited by 6 publications

References 68 publications

A deep hierarchy of predictions enables online meaning extraction in a computational model of human speech comprehension

A deep hierarchy of predictions enables online meaning extraction in a computational model of human speech comprehension

A brain-rhythm based computational framework for semantic context and acoustic signal integration in speech processing

A deep hierarchy of predictions enables assignment of semantic roles in online speech comprehension

Contact Info

Product

Resources

About