2023
DOI: 10.1101/2023.07.20.549865
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

3 Directional Inception-ResUNet: deep spatial feature learning for multichannel singing voice separation with distortion

Abstract: Singing voice separation on robots faces the problem of interpreting ambiguous auditory signals. The acoustic signal, which the humanoid robot perceives through its onboard microphones, is a mixture of singing voice, music, and noise, with distortion, attenuation, and reverberation. In this paper, we used the 3 directional Inception-Resnet structure in the U-shaped encoding and decoding network to improve the utilization of the spatial and spectral information of the spectrograms. Multi-objectives were used to… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 42 publications
0
0
0
Order By: Relevance