Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1609
|View full text |Cite
|
Sign up to set email alerts
|

INTERSPEECH 2021 Deep Noise Suppression Challenge

Abstract: The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area of noise suppression to achieve superior perceptual speech quality. We recently organized a DNS challenge special session at INTERSPEECH and ICASSP 2020. We opensourced training and test datasets for the wideband scenario. We also open-sourced a subjective evaluation framework based on ITU-T standard P.808, which was also used to evaluate participants of the challenge. Many researchers from academia and industry made signif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
55
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 116 publications
(56 citation statements)
references
References 12 publications
1
55
0
Order By: Relevance
“…The WSJ0 training speech utterances were mixed with a diverse set of noises coming from the FreeField72k dataset, which was also used in [ 25 ]. Motivated by the popularity of the freefield1010 dataset [ 31 ], which was also included in the DNS challenge [ 32 ], the FreeField72k extends its predecessor by taking a much bigger sample of the free field recordings from the source database. In the process of creation of the FreeField72k, the Freesound online database [ 33 ] was queried, and all of the recordings tagged with the “field-recording” tag were downloaded.…”
Section: Methodsmentioning
confidence: 99%
“…The WSJ0 training speech utterances were mixed with a diverse set of noises coming from the FreeField72k dataset, which was also used in [ 25 ]. Motivated by the popularity of the freefield1010 dataset [ 31 ], which was also included in the DNS challenge [ 32 ], the FreeField72k extends its predecessor by taking a much bigger sample of the free field recordings from the source database. In the process of creation of the FreeField72k, the Freesound online database [ 33 ] was queried, and all of the recordings tagged with the “field-recording” tag were downloaded.…”
Section: Methodsmentioning
confidence: 99%
“…We first take a comprehensive ablation study on the proposed model on the DNS-2020 dataset [1]. Then our model is trained, integrated with the post-processing module and evaluated with the Interspeech 2021 DNS challenge dataset (DNS-2021) [2] to show its performance on more complicated and real acoustic scenarios. We also compare other competitive models (such as PercepNet [27]) with our model on Voice Bank + DEMAND dataset [28] as these models have PESQ scores reported on this set.…”
Section: Datasetsmentioning
confidence: 99%
“…Such data-driven approaches have become the mainstream because of their strong noise reduction abilities (especially for non-stationary noise) learned from simulated cleannoisy speech pairs. The recent deep noise suppression challenge (DNS) series [1,2] have benchmarked many state-of-the-art DLbased speech enhancers, especially for real-time ones for speech communications, through subjective listening test and promising performance has been reported.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the background noise and reverberation degrade the speech and transcription quality. As these acoustic distortions can hamper communications and thus productivity, the speech enhancement (SE) field has drawn a lot of renewed attention recently [1,2].…”
Section: Introductionmentioning
confidence: 99%