“…This is non-trivial: the dictionary of templates must cover the full range of F 0s, there must be some mechanism to align the templates accurately with the substrate of frequency analysis (e.g., cochlea), and each template itself is a complex affair involving multiple slots with accurate tuning. It has been proposed that templates are learned from exposure to harmonic sounds such as speech ( Terhardt , 1974 ; Divenyi , 1979 ; Bowling & Purves , 2015 ; Saddler et al , 2020 ) possibly modulated by cultural preferences ( McDermott & Hauser , 2004 ; McDermott et al , 2010 , 2016 ; McPherson et al , 2020 ). The demonstration that templates can be learned from noise ( Shamma & Klein , 2000 ; Shamma & Dutta , 2019 ) makes that argument more tenuous, and highlights the question of what, exactly, is being learned.…”