2020
DOI: 10.1101/2020.11.19.389999
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception

Abstract: Computations on receptor responses enable behavior in the environment. Behavior is plausibly shaped by both the sensory receptors and the environments for which organisms are optimized, but their roles are often opaque. One classic example is pitch perception, whose properties are commonly linked to peripheral neural coding limits rather than environmental acoustic constraints. We trained artificial neural networks to estimate fundamental frequency from simulated cochlear representations of natural sounds. The… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(21 citation statements)
references
References 136 publications
(373 reference statements)
1
20
0
Order By: Relevance
“…The analogies with the brain thus seem most promising at the level of behavior and representations. Our results add to growing evidence that task-optimized models can produce human-like behavior for signals that are close to the manifold of natural sounds or images 50 , 116 , 117 . However, artificial neural networks also often exhibit substantial representational differences with humans, particularly for unnatural signals derived in various ways from a network 118 – 122 , and our model may exhibit similar divergences.…”
Section: Discussionsupporting
confidence: 71%
“…The analogies with the brain thus seem most promising at the level of behavior and representations. Our results add to growing evidence that task-optimized models can produce human-like behavior for signals that are close to the manifold of natural sounds or images 50 , 116 , 117 . However, artificial neural networks also often exhibit substantial representational differences with humans, particularly for unnatural signals derived in various ways from a network 118 – 122 , and our model may exhibit similar divergences.…”
Section: Discussionsupporting
confidence: 71%
“…This is non-trivial: the dictionary of templates must cover the full range of F 0s, there must be some mechanism to align the templates accurately with the substrate of frequency analysis (e.g., cochlea), and each template itself is a complex affair involving multiple slots with accurate tuning. It has been proposed that templates are learned from exposure to harmonic sounds such as speech ( Terhardt , 1974 ; Divenyi , 1979 ; Bowling & Purves , 2015 ; Saddler et al , 2020 ) possibly modulated by cultural preferences ( McDermott & Hauser , 2004 ; McDermott et al , 2010 , 2016 ; McPherson et al , 2020 ). The demonstration that templates can be learned from noise ( Shamma & Klein , 2000 ; Shamma & Dutta , 2019 ) makes that argument more tenuous, and highlights the question of what, exactly, is being learned.…”
Section: Discussionmentioning
confidence: 99%
“…cochlea), and each template itself is a complex affair involving multiple slots with accurate tuning. It has been proposed that templates are learned from exposure to harmonic sounds such as speech (Terhardt, 1974;Divenyi, 1979;Bowling and Purves, 2015;Saddler et al, 2020) possibly modulated by cultural preferences (Mcdermott and Hauser, 2004;McDermott et al, 2010McDermott et al, , 2016. The demonstration that templates can be learned from noise (Shamma and Klein, 2000;Shamma and Dutta, 2019) makes that argument more tenuous, but it then begs the question as to what is being learned.…”
Section: Discussionmentioning
confidence: 99%