ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9747055
|View full text |Cite
|
Sign up to set email alerts
|

Deepfilternet: A Low Complexity Speech Enhancement Framework for Full-Band Audio Based On Deep Filtering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(10 citation statements)
references
References 18 publications
0
10
0
Order By: Relevance
“…For full-band speech, the number of the input features increases markedly when features are extracted on a linear frequency scale, resulting in much greater computational complexity. The number of input features can be reduced by extracting them on a logarithmic frequency scale, or by using ERB-based or Bark-based filter banks to extract input features (Schröter et al, 2022).…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…For full-band speech, the number of the input features increases markedly when features are extracted on a linear frequency scale, resulting in much greater computational complexity. The number of input features can be reduced by extracting them on a logarithmic frequency scale, or by using ERB-based or Bark-based filter banks to extract input features (Schröter et al, 2022).…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…where For efficient DNN operation, BRnet is based on DeepFilterNet [27], which performs a two-stage denoising process with reference to human auditory properties. The BRnet, which consists of three modules, the Encoder, the ERB Decoder and the Deep Filter (DF) Decoder, is shown in Fig.…”
Section: A the Score Modulementioning
confidence: 99%
“…where q is the look-ahead of the deep filter. In contrast to [27], the proposed BRnet operates at 16 kHz and uses an ERB filterbank with 32 bands. The deep filter with N = 5 is applied only to the lowest 160 frequency bins which covers a 5 kHz bandwidth.…”
Section: A the Score Modulementioning
confidence: 99%
See 1 more Smart Citation
“…These filter banks retain the frequency domain information that is more important to human perception, effectively reducing the dimensionality of the input features and thus the complexity of the neural network model. DeepFilterNet [8], based on PercepNet, first enhances the spectral envelope using ERB-scaled gains and further enhances the periodic part of the preliminarily enhanced spectrum using a DeepFilter [9]. However, compact feature-based work typically uses expert hand-designed filters to derive compact real-valued features, which do not make use of phase information.…”
Section: Introductionmentioning
confidence: 99%