2008
DOI: 10.1109/icassp.2008.4518676
|View full text |Cite
|
Sign up to set email alerts
|

Distant talking robust speech recognition using late reflection components of room impulse response

Abstract: We propose a robust and fast d巴reverberation technique for real-time speech recognition application. First, we effectively identifシthe late reflection components of the room impulse response. We use this information together with the con cept of Spectral Subtraction (SS) to remove the late refl ection components of the reverberant signal. In the absence of the c]ean speech in actual scenario, approximation is carried out in estimating the late reflection where the estimation e汀or is corrected through multi-ban… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

1
38
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 27 publications
(39 citation statements)
references
References 6 publications
1
38
0
Order By: Relevance
“…Speech enhancement is employed to minimize the degradation of speech and improve ASR performance. We have proposed a dereverberation approach based on multi-band Spectral Subtraction (SS) [1][2] [3]. This method employs SS similar to that steered by multi-step linear prediction [4] by removing only the late reflection components of the reverberant speech signal.…”
Section: Introductionmentioning
confidence: 99%
“…Speech enhancement is employed to minimize the degradation of speech and improve ASR performance. We have proposed a dereverberation approach based on multi-band Spectral Subtraction (SS) [1][2] [3]. This method employs SS similar to that steered by multi-step linear prediction [4] by removing only the late reflection components of the reverberant speech signal.…”
Section: Introductionmentioning
confidence: 99%
“…Nakatani et al proposed a high-performance method of blind dereverberation based on ShortTime Fourier Transformation (STFT) representation [1]. Gomez et al applied fast spectral subtraction for late reverberation by using a pre-recorded impulse response [2]. However, these and other familiar methods have not dealt with the echo-cancellation problem, or used a priori knowledge about the environment, such as room impulse response.…”
Section: Introductionmentioning
confidence: 99%
“…We adopted multi-channel semi-blind independent component analysis (MCSB-ICA) [1], because: 1) it is theoretically robust against Gaussian noise, such as that from fans, 2) it can theoretically deal with separation of the known speech, user's speech, and other sound sources, including their reverberations. Other methods have not dealt with known-source signals [2], [3], [4], user's speech signals [5], or have not been able to deal with reverberation [6], [7]. The requirements for MCSB-ICA to achieve robot audition are: a) fast convergence speed for estimating the separation filter of source signals, and b) low computational cost.…”
Section: Introductionmentioning
confidence: 99%