2021
DOI: 10.1186/s13636-020-00191-3
|View full text |Cite
|
Sign up to set email alerts
|

Progressive loss functions for speech enhancement with deep neural networks

Abstract: The progressive paradigm is a promising strategy to optimize network performance for speech enhancement purposes. Recent works have shown different strategies to improve the accuracy of speech enhancement solutions based on this mechanism. This paper studies the progressive speech enhancement using convolutional and residual neural network architectures and explores two criteria for loss function optimization: weighted and uniform progressive. This work carries out the evaluation on simulated and real speech s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(4 citation statements)
references
References 25 publications
0
4
0
Order By: Relevance
“…Our model is a fully-convolutional denoising autoencoder with skip connections (Figure 1), in the style of previous effective SE models [27,30,34]. In training, we input a noisy audio waveform x ∈ R T , comprised of clean speech signal y ∈ R T and background noise n ∈ R T so that x = λy + (1 − λ)n, where λ is a parameter to control the SNR.…”
Section: Modelmentioning
confidence: 99%
“…Our model is a fully-convolutional denoising autoencoder with skip connections (Figure 1), in the style of previous effective SE models [27,30,34]. In training, we input a noisy audio waveform x ∈ R T , comprised of clean speech signal y ∈ R T and background noise n ∈ R T so that x = λy + (1 − λ)n, where λ is a parameter to control the SNR.…”
Section: Modelmentioning
confidence: 99%
“…Llombart et al [28] developed the progressive SE using convolutional and residual neural network structures. In this system, 2 different conditions were used for optimizing the loss factor such as weighted and homogeneous progressive.…”
Section: Literature Surveymentioning
confidence: 99%
“…Nowadays, Deep learning systems have achieved human-compatible success in predicting the labelsinalmostalldomains.Manyartificialintelligentsubdomaintechniquesareappliedinvariety of appliances from Malware Detection (Kumar,2020), Object Recognition (Bayraktar,2019),Ima ge Classification (Ahuja, 2020) (Rajagopal,2020), Speech Recognition (Llombart, 2021), Natural LanguageProcessing (Do,2021),MedicalScience (Esteva,2017),SatelliteApplications (Kumar,2020), toFacialRecognitionsystems (Menon,2021).Withthegrowingadoptionofdeepneuralnetworks bymanycompanies,DNNtheuseofDNNinsafety-criticalenvironmentapplicationsincluding, Drones,Robotics,VoiceRecognition,Self-drivingcarslikeUber,Apple&Samsung,Tesla (Lex,2019), Surveillancesystems (Pillai,2021),AppleSiri("Apple,"2019),AmazonAlexa(2019),Etc.…”
Section: Introductionmentioning
confidence: 99%