2016
DOI: 10.1101/074799
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Relating dynamic brain states to dynamic machine states: human and machine solutions to the speech recognition problem

Abstract: There is widespread interest in the relationship between the neurobiological systems supporting human cognition and emerging computational systems capable of emulating these capacities. Human speech comprehension, poorly understood as a neurobiological process, is an important case in point. Automatic Speech Recognition (ASR) systems with nearhuman levels of performance are now available, which provide a computationally explicit solution for the recognition of words in continuous speech. This research aims to … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
5
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(6 citation statements)
references
References 60 publications
1
5
0
Order By: Relevance
“…Our findings imply that developing appropriate intermediate representations for articulatory features may be central to speech recognition in both human and machine solutions. In human neuroscience studies, this account is consistent with previous findings of articulatory feature representation in the human auditory cortex [13,37,63], but awaits further investigation and exploitation in machine solutions for speech recognition. Recently, large deep artificial neural network models pre-trained on a massive amount of unlabelled waveform features (e.g.…”
Section: Discussionsupporting
confidence: 86%
See 4 more Smart Citations
“…Our findings imply that developing appropriate intermediate representations for articulatory features may be central to speech recognition in both human and machine solutions. In human neuroscience studies, this account is consistent with previous findings of articulatory feature representation in the human auditory cortex [13,37,63], but awaits further investigation and exploitation in machine solutions for speech recognition. Recently, large deep artificial neural network models pre-trained on a massive amount of unlabelled waveform features (e.g.…”
Section: Discussionsupporting
confidence: 86%
“…This has primarily been in the domain of visual object perception (e.g. [7,11,12,17,22,[27][28][29]34]), with less progress made in speech perception (though see our previous work; [56,63]).…”
Section: Putational Models In Vision and Auditionmentioning
confidence: 99%
See 3 more Smart Citations