“…Singing voice separation (SVS) has drawn a lot of interest and consideration in many downstream applications [ 1 , 2 , 3 , 4 ]. It deals with the technique of separating a singing voice or background from a mix of music, which is a crucial strategy for singer identification [ 5 , 6 ], music information retrieval [ 7 , 8 ], lyric recognition and alignment [ 9 , 10 , 11 , 12 ], song language identification [ 13 , 14 ], and chord recognition [ 15 , 16 , 17 ]. The recent separation techniques, however, fall well short of the capabilities of human hearing.…”