2015
DOI: 10.1186/s13634-015-0245-7
|View full text |Cite
|
Sign up to set email alerts
|

Strategies for distant speech recognitionin reverberant environments

Abstract: Reverberation and noise are known to severely affect the automatic speech recognition (ASR) performance of speech recorded by distant microphones. Therefore, we must deal with reverberation if we are to realize high-performance hands-free speech recognition. In this paper, we review a recognition system that we developed at our laboratory to deal with reverberant speech. The system consists of a speech enhancement (SE) front-end that employs long-term linear prediction-based dereverberation followed by noise r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
66
0
3

Year Published

2016
2016
2020
2020

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 62 publications
(70 citation statements)
references
References 47 publications
0
66
0
3
Order By: Relevance
“…In particular, dereverberation and beamforming based on linear filtering have been shown to be very effective to improve the robustness of ASR with multi-condition DNN-AMs [12,13]. However, even after performing such filtering, certain amount of residual noise inevitably remains, and substantially limits the improvement of the ASR performance.…”
Section: Related Workmentioning
confidence: 97%
“…In particular, dereverberation and beamforming based on linear filtering have been shown to be very effective to improve the robustness of ASR with multi-condition DNN-AMs [12,13]. However, even after performing such filtering, certain amount of residual noise inevitably remains, and substantially limits the improvement of the ASR performance.…”
Section: Related Workmentioning
confidence: 97%
“…Besides iterative LP filter estimation approaches such as [6], [8], [9], [11]- [13], also adaptive approaches based on recursive least squares (RLS) [7], [10], [16], [19] as well as the Kalman filter [14], [15], [17] have been proposed in the past years. In order to reduce noise after dereverberation, multiple-output MCLP has been cascaded with MVDR beamforming in [12], [13], which was seen to be a commonly adopted approach in the 2018 CHiME-5 challenge [23]. In [22], the cascade in [12], [13] is unified.…”
Section: Introductionmentioning
confidence: 99%
“…We further augment the proposed approach by a spectral Wiener gain post-processor, which is shown to relate to the Kalman filter's posterior state estimate. In order to demonstrate the effectiveness of the ISCLP Kalman filter, we compare against two state-of-the-art approaches -first the previously mentioned alternating Kalman filters in [18], and second a MCLP+GSC Kalman filter cascade, conceptually relating to [12], [13]. As compared to these two reference algorithms, the ISCLP Kalman filter is computationally roughly M 2 times less expensive, where M denotes the number of microphones.…”
Section: Introductionmentioning
confidence: 99%
“…One issue of the conventional cascade approach [12,13] was that the overall optimality was not guaranteed. The approach performs the optimization separately for dereverberation and beamforming.…”
Section: Introductionmentioning
confidence: 99%