Deep language algorithms predict semantic comprehension from brain activity

Caucheteux, Charlotte; Gramfort, Alexandre; J, King

doi:10.1038/s41598-022-20460-9

Cited by 62 publications

(82 citation statements)

References 67 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this study, we address these issues by analysing the brain signals of 304 individuals listening to short stories while their brain activity is recorded with fMRI 39 . After confirming that deep language algorithms linearly map onto brain activity 6,8,40 , we show that enhancing these models with long-range and multi-level predictions improves such brain mapping. Critically, and in line with predictive coding theory, our results reveal a hierarchical organization of language predictions in the cortex, in which the highest areas predict the most distant and highest-level representations.…”

Section: Isolating Long-range Predictions In the Brainsupporting

confidence: 56%

Evidence of a predictive coding hierarchy in the human brain listening to speech

2023

Self Cite

View full text Add to dashboard Cite

Considerable progress has recently been made in natural language processing: deep learning algorithms are increasingly able to generate, summarize, translate and classify texts. Yet, these language models still fail to match the language abilities of humans. Predictive coding theory offers a tentative explanation to this discrepancy: while language models are optimized to predict nearby words, the human brain would continuously predict a hierarchy of representations that spans multiple timescales. To test this hypothesis, we analysed the functional magnetic resonance imaging brain signals of 304 participants listening to short stories. First, we confirmed that the activations of modern language models linearly map onto the brain responses to speech. Second, we showed that enhancing these algorithms with predictions that span multiple timescales improves this brain mapping. Finally, we showed that these predictions are organized hierarchically: frontoparietal cortices predict higher-level, longer-range and more contextual representations than temporal cortices. Overall, these results strengthen the role of hierarchical predictive coding in language processing and illustrate how the synergy between neuroscience and artificial intelligence can unravel the computational bases of human cognition.

show abstract

Section: Isolating Long-range Predictions In the Brainsupporting

confidence: 56%

Evidence of a predictive coding hierarchy in the human brain listening to speech

2023

Self Cite

View full text Add to dashboard Cite

show abstract

“…The proposed approach is fundamentally different from a purely data-driven one that identifies neural response patterns correlated with pooled activities from hidden layers of a neural network trained on specific tasks of next-input predictions such as in (62, 64, 65). The brain interacts with the external stimuli, whether linguistic or not, in a structured fashion that is likely reused across different domains (44, 58).…”

Section: Discussionmentioning

confidence: 99%

“…Two types of information theoretic metrics have been of particular interest in establishing the connection between abstract information and biophysical signals to probe the brain's information processing capacity: surprisal (related to, but distinct from divergence) and entropy. Efforts in associating neurophysiological responses to surprisal for next-word expectation, either based on cloze probability tests (32,(59)(60)(61) or the probabilistic distribution estimated by computational models (35)(36)(37)(62)(63)(64)(65), largely credit Levy's influential work on expectation-based comprehension (10). Levy proposed a formal relationship between incremental comprehension effort and the Kullback-Leibler divergence (KLD) of syntactic structure inference before and after receiving a word input W, and proved that the KLD reduced to the surprisal of W given the previous word string when conditioned on a constant extrasentential context that constrains comprehension.…”

Section: Understanding Neural Information Transfer Through Divergence...mentioning

confidence: 99%

A deep hierarchy of predictions enables assignment of semantic roles in online speech comprehension

Olasagasti

Giraud

2022

Preprint

View full text Add to dashboard Cite

Understanding speech requires mapping fleeting and often ambiguous soundwaves to meaning. Humans are known to exploit their capacity to contextualize to facilitate this process, but how internal knowledge is used and deployed in real time remains an open question. Existing models of speech processing focus on either word recognition irrespective of meaning or interactions among abstract linguistic representations without time constraints, providing only partial insights into the dynamics of speech comprehension. Here, we present a model that incrementally extracts multiple levels of information from continuous speech signals in real time, based on the inversion of a generative model that represents the listener’s internal knowledge of linguistic and non-linguistic processing levels in a nested temporal hierarchy. In each hierarchy, the model periodically incorporates bottom-up incoming evidence to update its internal representations and generate new top-down predictions. We show that a context level, beyond linguistic representations, can provide the model with semantic predictions informed by sensory inputs, crucial for the disambiguation among multiple meanings of the same word. We also show that hierarchical predictions can reduce peripheral processing effort via minimizing uncertainty and prediction error, especially when sensory precisions become degraded. With this proof-of-concept model we demonstrate that the deployment of hierarchical predictions is a possible strategy for the brain to utilize structured knowledge dynamically for speech comprehension.

show abstract

“…These include predicting the next word in a sequence, utilising contextual information to generate those predictions, and calculating the surprise when predictions are violated. A number of studies employing sentence comprehension paradigms now support the notion that hierarchical representations allowing probabilistic computations operate in both humans and DLNs (27)(28)(29), with transformer-like predictive processing explaining nearly 100% of the explainable variance in neural activity during sentence processing tasks (29). Despite this impressive correspondence with expected brain activity, when generating human-like language, both rule-based and DLN-based NLG results in numerous errors, some of which, as we discuss below, are reminiscent of psychotic symptoms.…”

Section: Natural Language Generationmentioning

confidence: 99%

“…Connectionist models employing neural networks (in this case, DLNs) are particularly appealing in psychosis given the ample evidence implicating disturbances in cognitive operations implemented by distributed brain networks as a core feature of conditions such as schizophrenia (70). In silico or toy models based on neural networks also provide a means to interpret neuroimaging data obtained from human participants (27)(28)(29). While a number of neural network models have been used previously to study psychosis, we see several distinct advantages with DLNs.…”

Section: Factors Contributing To Nlg Errorsmentioning

confidence: 99%

Studying psychosis using Natural Language Generation: A review of emerging opportunities

Palaniyappan¹,

Benrimoh²,

Voppel³

et al. 2023

Preprint

View full text Add to dashboard Cite

Disrupted language in psychotic disorders, such as schizophrenia, can manifest as false contents and formal deviations, often described as thought disorder. These features play a critical role in the social dysfunction associated with psychosis, but we continue to lack insights regarding how these symptoms develop. Natural language Generation (NLG) is a field of computer science that focuses on generating human-like language for various applications. The theory that psychosis is related to the evolution of language in humans suggests that NLG systems that are sufficiently evolved to generate human-like language may also exhibit psychosis-like features. In this conceptual review, we propose using NLG systems that are at various stages of development as in-silico tools to study linguistic features of psychosis. This will allow us to gain a better understanding of the relationship between language and psychosis and potentially pave the way for new therapeutic approaches to address this vexing challenge.

show abstract

Deep language algorithms predict semantic comprehension from brain activity

Cited by 62 publications

References 67 publications

Evidence of a predictive coding hierarchy in the human brain listening to speech

Evidence of a predictive coding hierarchy in the human brain listening to speech

A deep hierarchy of predictions enables assignment of semantic roles in online speech comprehension

Studying psychosis using Natural Language Generation: A review of emerging opportunities

Contact Info

Product

Resources

About