2021
DOI: 10.48550/arxiv.2107.07503
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech

Abstract: Deep learning approaches have emerged that aim to transform an audio signal so that it sounds as if it was recorded in the same room as a reference recording, with applications both in audio postproduction and augmented reality. In this work, we propose FiNS, a Filtered Noise Shaping network that directly estimates the time domain room impulse response (RIR) from reverberant speech. Our domain-inspired architecture features a time domain encoder and a filtered noise shaping decoder that models the RIR as a sum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 25 publications
0
1
0
Order By: Relevance
“…To solve the estimation problem of transferplausible rendering, we propose a method for blind DRIR estimation based on beamforming and deconvolution, or rather adaptive system identification. This straightforward, interpretable approach is an alternative to algorithms for transferring acoustics from one signal to another that are entirely based on deep neural networks [2,3].…”
Section: Introductionmentioning
confidence: 99%
“…To solve the estimation problem of transferplausible rendering, we propose a method for blind DRIR estimation based on beamforming and deconvolution, or rather adaptive system identification. This straightforward, interpretable approach is an alternative to algorithms for transferring acoustics from one signal to another that are entirely based on deep neural networks [2,3].…”
Section: Introductionmentioning
confidence: 99%