2018
DOI: 10.1016/j.csl.2018.04.003
|View full text |Cite
|
Sign up to set email alerts
|

Comparing human and automatic speech recognition in simple and complex acoustic scenes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(10 citation statements)
references
References 34 publications
0
10
0
Order By: Relevance
“…With recent advances in automatic speech recognition (ASR), there has been an increased interest in comparing human speech recognition (HSR) with ASR (Rader et al, 2015;Stolcke and Droppo, 2017;Spille and Meyer, 2017;Kell et al, 2018;Spille et al, 2018;Hu et al, 2020;Kollmeier et al, 2020). Such comparisons can be used to pinpoint existing weaknesses in ASR systems, but as the gap between HSR and ASR tightens further, ASR models may also be used as a proxy for human hearing.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…With recent advances in automatic speech recognition (ASR), there has been an increased interest in comparing human speech recognition (HSR) with ASR (Rader et al, 2015;Stolcke and Droppo, 2017;Spille and Meyer, 2017;Kell et al, 2018;Spille et al, 2018;Hu et al, 2020;Kollmeier et al, 2020). Such comparisons can be used to pinpoint existing weaknesses in ASR systems, but as the gap between HSR and ASR tightens further, ASR models may also be used as a proxy for human hearing.…”
Section: Introductionmentioning
confidence: 99%
“…The impressive results of these modelling efforts (even if not universal; Jacob et al 2021) have led to the suggestion that this approach should be extended more widely in neuroscience (Kriegeskorte, 2015; Richards et al, 2019). With recent advances in deep learning-based automatic speech recognition (ASR), there has therefore been an increased interest in comparing human speech recognition (HSR) with ASR (Rader et al, 2015; Stolcke and Droppo, 2017; Spille and Meyer, 2017; Kell et al, 2018; Spille et al, 2018; Hu et al, 2020; Kollmeier et al, 2020). ASR models may provide an opportunity to support or generate new hypotheses about the functioning of HSR.…”
Section: Introductionmentioning
confidence: 99%
“…Despite that some of the latest ASR systems outperform human performance on specific tasks [14,22], ASR performance is clearly inferior to human auditory system under challenging acoustic conditions, i.e. when input speech is clipped, spectrally modulated, bandpass filtered, or masked by noise [15,23]. The present study is focused on the effect of LTR speech on E2E ASR.…”
Section: Locally-time Reversed Speechmentioning
confidence: 99%
“…D IRECTION Of Arrival (DOA) estimation and Sound Source Localization with microphone arrays has been widely investigated and used in different applications, such as robot audition [1], [2], acoustic characterization [3], speech recognition [4], [5] or teleconference systems [6]. Most of the techniques in the literature can be roughly classified into i) Time Difference Of Arrival (TDOA) based techniques, which first use the Generalized Cross-Correlation (GCCs) functions [7] to estimate the TDOA and then compute the most reliable DOA for them, ii) beamforming based techniques, such as SRP-PHAT [8], [9], which search the direction that maximizes the power of the output of a beamformer, and iii) subspace techniques, such as Multiple Signal Classification (MUSIC) [10], [11], based on the eigenstructure of the narrowband cross-correlation matrices.…”
Section: Introductionmentioning
confidence: 99%