2022
DOI: 10.48550/arxiv.2203.17068
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers

Abstract: In this paper, we present a novel framework that jointly performs speaker diarization, speech separation, and speaker counting. Our proposed method combines end-to-end speaker diarization and speech separation methods, namely, End-to-End Neural Speaker Diarization with Encoder-Decoder-based Attractor calculation (EEND-EDA) and the Convolutional Timedomain Audio Separation Network (ConvTasNet) as multitasking joint model. We also propose the multiple 1×1 convolutional layer architecture for estimating the separ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 32 publications
(47 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?