2017
DOI: 10.1186/s13634-017-0516-6
|View full text |Cite
|
Sign up to set email alerts
|

A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation

Abstract: A reverberation-time-aware deep-neural-network (DNN)-based multi-channel speech dereverberation framework is proposed to handle a wide range of reverberation times (RT60s). There are three key steps in designing a robust system. First, to accomplish simultaneous speech dereverberation and beamforming, we propose a framework, namely DNNSpatial, by selectively concatenating log-power spectral (LPS) input features of reverberant speech from multiple microphones in an array and map them into the expected output LP… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 41 publications
0
2
0
Order By: Relevance
“…However, the DNN-based vector-to-vector regression, which is the focus in this work, mainly aims at single-channel speech enhancement and is not simply generalized to multi-channel speech enhancement. As shown in Figure 1, a traditional approach to dealing with an array of microphones is exploited spatial information at the input level by concatenating speech vectors from multiple microphones into a single high dimensional vector, e.g., [12,2]. Thus the vector-to-vector regression approach can still be employed for speech enhancement by appending multichannel feature vectors together into a high-dimensional vector and mapping it to a vector extracted from the reference vector.…”
Section: Introductionmentioning
confidence: 99%
“…However, the DNN-based vector-to-vector regression, which is the focus in this work, mainly aims at single-channel speech enhancement and is not simply generalized to multi-channel speech enhancement. As shown in Figure 1, a traditional approach to dealing with an array of microphones is exploited spatial information at the input level by concatenating speech vectors from multiple microphones into a single high dimensional vector, e.g., [12,2]. Thus the vector-to-vector regression approach can still be employed for speech enhancement by appending multichannel feature vectors together into a high-dimensional vector and mapping it to a vector extracted from the reference vector.…”
Section: Introductionmentioning
confidence: 99%
“…In this work, an improved calibration method, which takes into account the generalisation performance and robustness of geometric parameter correction, is introduced to enhance the essential positioning accuracy of robots. DNN has received substantial attention in both the signal processing field and the machine learning field with its strong regression capabilities [29][30][31]. e design of DNN architecture must be optimised to make the DNN demonstrate the best predictive capacity.…”
Section: Introductionmentioning
confidence: 99%