2023
DOI: 10.1109/access.2023.3330160
|View full text |Cite
|
Sign up to set email alerts
|

Blind Source Separation Based on Improved Wave-U-Net Network

Chaofeng Lan,
Jingjuan Jiang,
Lei Zhang
et al.

Abstract: With the development and widespread application of voice interaction technology, it has become crucial to improve the accuracy of blind source separation technology. In order to further enhance the separation results of vocal and accompaniment, this paper proposes an improved Wave-U-Net model. Based on the skip connection of the Wave-U-Net model, we propose a segmented attention module (SAM) consisting of a spatial attention module (SPAM) and a channel attention module (CAM) to replace the skip connections in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…Through mathematical analysis and experimental validation, it demonstrates how frame-level W-disjoint orthogonality correlates with the algorithm's performance, asserting that IVA can achieve optimal separation under specific sparsity conditions, thus providing insights for enhancing BSS strategies. Lan et al [36] introduces improvements to the Wave-U-Net model for vocal and accompaniment separation using an attention module and a spatial pyramid pooling layer. These enhancements aim to bridge semantic gaps and expand the receptive field, respectively.…”
Section: Related Workmentioning
confidence: 99%
“…Through mathematical analysis and experimental validation, it demonstrates how frame-level W-disjoint orthogonality correlates with the algorithm's performance, asserting that IVA can achieve optimal separation under specific sparsity conditions, thus providing insights for enhancing BSS strategies. Lan et al [36] introduces improvements to the Wave-U-Net model for vocal and accompaniment separation using an attention module and a spatial pyramid pooling layer. These enhancements aim to bridge semantic gaps and expand the receptive field, respectively.…”
Section: Related Workmentioning
confidence: 99%