2020
DOI: 10.1109/access.2020.3009987
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Identification in Multi-Talker Overlapping Speech Using Neural Networks

Abstract: Although numerous works have studied the problem of automatic speaker identification (SID), there are only few works on the SID for overlapping speech, and none of them consider the case of more than two simultaneous speakers. Recognizing that overlapping speech occurs frequently in real-life scenarios, such as in meetings or debates, this work investigates the methods for overlapping SID (OSID) that can determine identities in the overlapping speech from up to five simultaneous speakers. We propose two deep-l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…In ASR, Overlapping Speech Detection (OSD) improves transcription performance by effectively detecting and separating overlapping speech, minimizing errors introduced by speaker overlap [8], [9]. For speaker identification, OSD is critical to detect overlaps and allow precise separation, essential for accurate speaker matching [10]. In speaker diarization, OSD improves the accuracy of attributing speech segments to individual speakers by identifying instances of overlapping speech [11].…”
Section: Introductionmentioning
confidence: 99%
“…In ASR, Overlapping Speech Detection (OSD) improves transcription performance by effectively detecting and separating overlapping speech, minimizing errors introduced by speaker overlap [8], [9]. For speaker identification, OSD is critical to detect overlaps and allow precise separation, essential for accurate speaker matching [10]. In speaker diarization, OSD improves the accuracy of attributing speech segments to individual speakers by identifying instances of overlapping speech [11].…”
Section: Introductionmentioning
confidence: 99%