ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053789
|View full text |Cite
|
Sign up to set email alerts
|

Controlling the Perceived Sound Quality for Dialogue Enhancement With Deep Learning

Abstract: Speech enhancement attenuates interfering sounds in speech signals but may introduce artifacts that perceivably deteriorate the output signal. We propose a method for controlling the trade-off between the attenuation of the interfering background signal and the loss of sound quality. A deep neural network estimates the attenuation of the separated background signal such that the sound quality, quantified using the Artifact-related Perceptual Score, meets an adjustable target. Subjective evaluations indicate th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…In source separation, it is often of interest to assess the interferer reduction and the presence of distortions, artifacts, and colorations independently rather than jointly [55], [75]. This can give useful diagnostic information, e.g., for supporting the interpretation of a listening test [76] or for controlling the amount of interferer reduction such that a desired artifacts-related quality level is met [77]. As far as perceptual scores assessing exclusively artifacts, only one dataset is available (PEASS APS LT), in which listeners rated the quality in terms of absence of additional artificial noise.…”
Section: Results For the Artifacts-only Scores For The Source Separat...mentioning
confidence: 99%
“…In source separation, it is often of interest to assess the interferer reduction and the presence of distortions, artifacts, and colorations independently rather than jointly [55], [75]. This can give useful diagnostic information, e.g., for supporting the interpretation of a listening test [76] or for controlling the amount of interferer reduction such that a desired artifacts-related quality level is met [77]. As far as perceptual scores assessing exclusively artifacts, only one dataset is available (PEASS APS LT), in which listeners rated the quality in terms of absence of additional artificial noise.…”
Section: Results For the Artifacts-only Scores For The Source Separat...mentioning
confidence: 99%