ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9413411
|View full text |Cite
|
Sign up to set email alerts
|

Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments

Abstract: In the speaker extraction problem, it is found that additional information from the target speaker contributes to the tracking and extraction of the target speaker, which includes voiceprint, lip movement, facial expression, and spatial information. However, no one cares for the cue of sound onset, which has been emphasized in the auditory scene analysis and psychology. Inspired by it, we explicitly modeled the onset cue and verified the effectiveness in the speaker extraction task. We further extended to the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…For the cocktail party effect, many effective end-to-end neural network models have been proposed (Ephrat et al, 2018;Chao et al, 2019;Hao et al, 2021;Wang et al, 2021). However, the analysis of why these networks work is very difficult since the functional structures in these black-box models are very dense without clear function diversity.…”
Section: Related Workmentioning
confidence: 99%
“…For the cocktail party effect, many effective end-to-end neural network models have been proposed (Ephrat et al, 2018;Chao et al, 2019;Hao et al, 2021;Wang et al, 2021). However, the analysis of why these networks work is very difficult since the functional structures in these black-box models are very dense without clear function diversity.…”
Section: Related Workmentioning
confidence: 99%