Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception

Saddler, Mark R.; Gonzalez, Ray; McDermott, John H

doi:10.1101/2020.11.19.389999

Cited by 17 publications

(21 citation statements)

References 136 publications

(373 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The analogies with the brain thus seem most promising at the level of behavior and representations. Our results add to growing evidence that task-optimized models can produce human-like behavior for signals that are close to the manifold of natural sounds or images 50 , 116 , 117 . However, artificial neural networks also often exhibit substantial representational differences with humans, particularly for unnatural signals derived in various ways from a network 118 – 122 , and our model may exhibit similar divergences.…”

Section: Discussionsupporting

confidence: 71%

Deep neural network models of sound localization reveal how perception is adapted to real-world environments

Francl

McDermott

2022

Nat Hum Behav

Self Cite

View full text Add to dashboard Cite

Mammals localize sounds using information from their two ears. Localization in real-world conditions is challenging, as echoes provide erroneous information, and noises mask parts of target sounds. To better understand real-world localization we equipped a deep neural network with human ears and trained it to localize sounds in a virtual environment. The resulting model localized accurately in realistic conditions with noise and reverberation. In simulated experiments, the model exhibited many features of human spatial hearing: sensitivity to monaural spectral cues and interaural time and level differences, integration across frequency, biases for sound onsets, and limits on localization of concurrent sources. But when trained in unnatural environments without either reverberation, noise, or natural sounds, these performance characteristics deviated from those of humans. The results show how biological hearing is adapted to the challenges of real-world environments and illustrate how artificial neural networks can reveal the real-world constraints that shape perception.

show abstract

Section: Discussionsupporting

confidence: 71%

Deep neural network models of sound localization reveal how perception is adapted to real-world environments

Francl

McDermott

2022

Nat Hum Behav

Self Cite

View full text Add to dashboard Cite

show abstract

“…This is non-trivial: the dictionary of templates must cover the full range of F 0s, there must be some mechanism to align the templates accurately with the substrate of frequency analysis (e.g., cochlea), and each template itself is a complex affair involving multiple slots with accurate tuning. It has been proposed that templates are learned from exposure to harmonic sounds such as speech ( Terhardt , 1974 ; Divenyi , 1979 ; Bowling & Purves , 2015 ; Saddler et al , 2020 ) possibly modulated by cultural preferences ( McDermott & Hauser , 2004 ; McDermott et al , 2010 , 2016 ; McPherson et al , 2020 ). The demonstration that templates can be learned from noise ( Shamma & Klein , 2000 ; Shamma & Dutta , 2019 ) makes that argument more tenuous, and highlights the question of what, exactly, is being learned.…”

Section: Discussionmentioning

confidence: 99%

Harmonic Cancellation—A Fundamental of Auditory Scene Analysis

Cheveigné

2021

Trends in Hearing

View full text Add to dashboard Cite

This paper reviews the hypothesis of harmonic cancellation according to which an interfering sound is suppressed or canceled on the basis of its harmonicity (or periodicity in the time domain) for the purpose of Auditory Scene Analysis. It defines the concept, discusses theoretical arguments in its favor, and reviews experimental results that support it, or not. If correct, the hypothesis may draw on time-domain processing of temporally accurate neural representations within the brainstem, as required also by the classic equalization-cancellation model of binaural unmasking. The hypothesis predicts that a target sound corrupted by interference will be easier to hear if the interference is harmonic than inharmonic, all else being equal. This prediction is borne out in a number of behavioral studies, but not all. The paper reviews those results, with the aim to understand the inconsistencies and come up with a reliable conclusion for, or against, the hypothesis of harmonic cancellation within the auditory system.

show abstract

“…cochlea), and each template itself is a complex affair involving multiple slots with accurate tuning. It has been proposed that templates are learned from exposure to harmonic sounds such as speech (Terhardt, 1974;Divenyi, 1979;Bowling and Purves, 2015;Saddler et al, 2020) possibly modulated by cultural preferences (Mcdermott and Hauser, 2004;McDermott et al, 2010McDermott et al, , 2016. The demonstration that templates can be learned from noise (Shamma and Klein, 2000;Shamma and Dutta, 2019) makes that argument more tenuous, but it then begs the question as to what is being learned.…”

Section: Discussionmentioning

confidence: 99%

Harmonic Cancellation - a Fundamental of Auditory Scene Analysis

Cheveigné¹

2021

Preprint

View full text Add to dashboard Cite

This paper reviews the hypothesis of {\em harmonic cancellation}\ according to which an interfering sound is suppressed or canceled on the basis of its harmonicity (or periodicity in the time domain). It defines the concept, discusses theoretical arguments in its favor, and reviews experimental results that support it, or not. If correct, the hypothesis likely draws on time domain processing of temporally-accurate neural representations within the brainstem, as required also by the classic Equalization-Cancellation (EC) model of binaural unmasking. It predicts that a target sound corrupted by interference will be easier to hear if the interference is harmonic than inharmonic, all else being equal. This prediction is borne out in a number of behavioral studies, but not all. The paper reviews those results, with the aim to understand the inconsistencies and come up with a reliable conclusion for, or against, the hypothesis of harmonic cancellation within the auditory system.

show abstract

Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception

Cited by 17 publications

References 136 publications

Deep neural network models of sound localization reveal how perception is adapted to real-world environments

Deep neural network models of sound localization reveal how perception is adapted to real-world environments

Harmonic Cancellation—A Fundamental of Auditory Scene Analysis

Harmonic Cancellation - a Fundamental of Auditory Scene Analysis

Contact Info

Product

Resources

About