2020
DOI: 10.3390/app10072465
|View full text |Cite
|
Sign up to set email alerts
|

Improving Singing Voice Separation Using Curriculum Learning on Recurrent Neural Networks

Abstract: Single-channel singing voice separation has been considered a difficult task, as it requires predicting two different audio sources independently from mixed vocal and instrument sounds recorded by a single microphone. We propose a new singing voice separation approach based on the curriculum learning framework, in which learning is started with only easy examples and then task difficulty is gradually increased. In this study, we regard the data providing obviously dominant characteristics of a single source as… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…For singing voice separation, in [47], a curriculum learning approach was considered, where the training begins with easy examples and the difficulty is steadily increased. Three different databases were tested: MIR-1K [48], ccMixter [49], and MUSDB18 [50], with the model yielding improved performance with respect to the global normalized source distortion ratio measure.…”
Section: B Recurrent Neural Network (Rnn)mentioning
confidence: 99%
“…For singing voice separation, in [47], a curriculum learning approach was considered, where the training begins with easy examples and the difficulty is steadily increased. Three different databases were tested: MIR-1K [48], ccMixter [49], and MUSDB18 [50], with the model yielding improved performance with respect to the global normalized source distortion ratio measure.…”
Section: B Recurrent Neural Network (Rnn)mentioning
confidence: 99%
“…Kang et al [175] proposed a singing voice separation approach based on the curriculum learning framework, in which learning is started with only easy examples and then the task difficulty is gradually increased. They define easy examples as the ones in which one source is obviously dominant over the other, where the dominance factor depends on the relative intensity of vocals and instruments.…”
Section: Singing Voice Separationmentioning
confidence: 99%
“…The second approach develops supervised models based on the task optimization on a specific training dataset. Thanks to the development of deep learning, recently proposed deep source separation systems have made significant progress [10,[14][15][16], which has begun to perform at the human level for natural source separation. The second approach has attracted significant attention because of its excellent performance, and it is also used here to solve the problem of two source separation from monaural recordings.…”
Section: Introductionmentioning
confidence: 99%