2022
DOI: 10.3390/app12020832
|View full text |Cite
|
Sign up to set email alerts
|

Sound Source Separation Mechanisms of Different Deep Networks Explained from the Perspective of Auditory Perception

Abstract: Thanks to the development of deep learning, various sound source separation networks have been proposed and made significant progress. However, the study on the underlying separation mechanisms is still in its infancy. In this study, deep networks are explained from the perspective of auditory perception mechanisms. For separating two arbitrary sound sources from monaural recordings, three different networks with different parameters are trained and achieve excellent performances. The networks’ output can obta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…Although these proposals are highly accurate, their application is not always feasible in a real-time scenario. This is mainly due to the computational complexity of these approaches being very high, essentially in implementing sound separation mechanisms [42,43].…”
Section: Introductionmentioning
confidence: 99%
“…Although these proposals are highly accurate, their application is not always feasible in a real-time scenario. This is mainly due to the computational complexity of these approaches being very high, essentially in implementing sound separation mechanisms [42,43].…”
Section: Introductionmentioning
confidence: 99%
“…The possible migration from conventional methods to the methods based on the ANNs in the field of microwave devices was started to be investigated based on the disadvantages, which are mentioned above [28,29]. The usage of ANNs seems promising because ANNs are already successfully used in other fields, such as image processing [30], sound processing [31], and the forecasting of financial markets [32]. The advantage of artificial neural networks (ANNs) in this case is the speed of prediction.…”
Section: Introductionmentioning
confidence: 99%
“…While the learned components of the model attempt to compensate, they are unable to do so. [48,289] recently showed that Conv-TasNet learns to separate signals based on proximity in frequency. In other words, if two sounds are close together in frequency, then Conv-TasNet is more likely to believe that they are from the same source.…”
Section: Toy Analysis Of Filterbank Correlationmentioning
confidence: 99%