2015
DOI: 10.1186/s13636-015-0073-6
|View full text |Cite
|
Sign up to set email alerts
|

Speech enhancement based on Bayesian decision and spectral amplitude estimation

Abstract: In this paper, a single-channel speech enhancement method based on Bayesian decision and spectral amplitude estimation is proposed, in which the speech detection module and spectral amplitude estimation module are included, and the two modules are strongly coupled. First, under the decisions of speech presence and speech absence, the optimal speech amplitude estimators are obtained by minimizing a combined Bayesian risk function, respectively. Second, using the obtained spectral amplitude estimators, the optim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…The MUSHRA test consisted of 20 units, each unit presented the labelled clean audio file as a reference, and six other audio files with hidden labels. The six hidden audio files are: 1) The original noisy signal, 2) The enhanced noisy signal produced by SE-GAN, SE-WaveNet and SE-VCAE, 3) The clean audio file (this is the hidden reference), 4) The noisy speech signal but with an SNR 5dB less than the signal being enhanced [39] (this is the anchor). For each of the 20 units, the unlabelled audio files were in a randomised order.…”
Section: Mushra Listening Test Descriptionmentioning
confidence: 99%
“…The MUSHRA test consisted of 20 units, each unit presented the labelled clean audio file as a reference, and six other audio files with hidden labels. The six hidden audio files are: 1) The original noisy signal, 2) The enhanced noisy signal produced by SE-GAN, SE-WaveNet and SE-VCAE, 3) The clean audio file (this is the hidden reference), 4) The noisy speech signal but with an SNR 5dB less than the signal being enhanced [39] (this is the anchor). For each of the 20 units, the unlabelled audio files were in a randomised order.…”
Section: Mushra Listening Test Descriptionmentioning
confidence: 99%
“…The MUSHRA listening test [77] is a commonly used method for the subjective evaluation of audio quality. It does not require a huge number of participants to obtain a statistically significant result [79,80] reference. For this reason, we have used the MUSHRA listening test to evaluate the subjective quality of the SS results of the URMP database.…”
Section: Appendix 1: Listening Testsmentioning
confidence: 99%
“…Speaker independence in ASR should only be applied if the nature of application demands it. [18] B. Proposed Solution The proposed scheme is an IoT implementation at constrained embedded level where sensor nodes are provided local IP address while major routing nodes are provided full-fledged internet protocol support which also solves the security issue.…”
Section: The I/o Interface For Actuation and Monitoring Shouldmentioning
confidence: 99%