2020
DOI: 10.48550/arxiv.2008.03339
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Learning Based Dereverberation of Temporal Envelopesfor Robust Speech Recognition

Anurenjan Purushothaman,
Anirudh Sreeram,
Rohit Kumar
et al.

Abstract: Automatic speech recognition in reverberant conditions is a challenging task as the long-term envelopes of the reverberant speech are temporally smeared. In this paper, we propose a neural model for enhancement of sub-band temporal envelopes for dereverberation of speech. The temporal envelopes are derived using the autoregressive modeling framework of frequency domain linear prediction (FDLP). The neural enhancement model proposed in this paper performs an envelop gain based enhancement of temporal envelopes … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
5
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 27 publications
1
5
0
Order By: Relevance
“…We use FDLP features [31] for far-field speech. This paper extends the prior work done in [32] by proposing a joint neural dereverberation which forms an elegant neural learning framework. Further, several ASR experiments with the joint modeling approach are also conducted in this work.…”
Section: Related Prior Workmentioning
confidence: 76%
“…We use FDLP features [31] for far-field speech. This paper extends the prior work done in [32] by proposing a joint neural dereverberation which forms an elegant neural learning framework. Further, several ASR experiments with the joint modeling approach are also conducted in this work.…”
Section: Related Prior Workmentioning
confidence: 76%
“…al [27], however the frequency domain loss is used for training. Previously, we had proposed a convolutional neural network model to perform dereverberation of speech [17,18]. This approach used a neural network applied on autoregressive models of sub-band envelopes to perform dereverberation.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Let x q (n), h q (n) and r q (n) denote the sub-band clean speech, room-response and the reverberant speech respectively for the qth sub-band. The sub-band envelopes of farfield speech m rq (n), extracted using frequency domain linear prediction (FDLP) [28,3], can be approximated as, [17]…”
Section: Signal Modelmentioning
confidence: 99%
See 2 more Smart Citations