2018
DOI: 10.1016/j.neuron.2018.03.044
|View full text |Cite
|
Sign up to set email alerts
|

A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy

Abstract: A core goal of auditory neuroscience is to build quantitative models that predict cortical responses to natural sounds. Reasoning that a complete model of auditory cortex must solve ecologically relevant tasks, we optimized hierarchical neural networks for speech and music recognition. The best-performing network contained separate music and speech pathways following early shared processing, potentially replicating human cortical organization. The network performed both tasks as well as humans and exhibited hu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

44
578
3
2

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 471 publications
(658 citation statements)
references
References 72 publications
44
578
3
2
Order By: Relevance
“…It is only at the final, higher-level stages, that the two systems diverge to a specialized system that is finely tuned to facial features, and to a general object system that processes faces and objects similarly, but cannot support face recognition, similar to the primate visual system. A similar divergence at higher layers of processing was recently discovered for auditory stimuli between processing of speech and music in a task-optimized DCNN that generated humanlike performance (14).…”
Section: Discussionsupporting
confidence: 64%
“…It is only at the final, higher-level stages, that the two systems diverge to a specialized system that is finely tuned to facial features, and to a general object system that processes faces and objects similarly, but cannot support face recognition, similar to the primate visual system. A similar divergence at higher layers of processing was recently discovered for auditory stimuli between processing of speech and music in a task-optimized DCNN that generated humanlike performance (14).…”
Section: Discussionsupporting
confidence: 64%
“…First, deep learning models have been used to simulate actual brain mechanisms, such as in vision (Khaligh-Razavi and Kriegeskorte, 2014;Yamins et al, 2014;Eickenberg et al, 2017) and auditory perception (Kell et al, 2018). Second, DNNs have been applied as tools to analyze neuroscience data, including lesion and tumor segmentation (Pinto et al, 2016;Havaei et al, 2017;Kamnitsas et al, 2017b;, anatomical segmentation (Wachinger et al, 2018;X.…”
Section: Introductionmentioning
confidence: 99%
“…1 However, when deep or recurrent neural networks are used to model neurobiological systems, the comparison between model activity and brain activity is often only verified at a coarse resolution, at the level of entire population dynamics 2,3 , or linear combinations of neurons [4][5][6] , and in contexts that are not very different from the contexts that the networks were originally trained in. Thus, the advent of deep learning as a modeling approach in neuroscience raises two more fundamental unanswered questions.…”
mentioning
confidence: 99%