Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1208
|View full text |Cite
|
Sign up to set email alerts
|

The Third DIHARD Diarization Challenge

Abstract: This paper introduces the third DIHARD challenge, the third in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variation in recording equipment, noise conditions, and conversational domain. Speaker diarization is evaluated under two segmentation conditions (diarization from a reference speech segmentation vs. diarization from scratch) and 11 diverse domains. The domains span a range of recording conditions and interaction types, including read audiobooks,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
40
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 60 publications
(41 citation statements)
references
References 37 publications
1
40
0
Order By: Relevance
“…DIVE establishes a new state-of-the-art on the standard CALLHOME benchmark, with 6.7% DER compared to 7.8% for the best alternative. In the future, we aim to address experimental settings with variable number of speakers and noisier acoustic conditions [38], [39].…”
Section: Discussionmentioning
confidence: 99%
“…DIVE establishes a new state-of-the-art on the standard CALLHOME benchmark, with 6.7% DER compared to 7.8% for the best alternative. In the future, we aim to address experimental settings with variable number of speakers and noisier acoustic conditions [38], [39].…”
Section: Discussionmentioning
confidence: 99%
“…For diarization, our experimental setup is based on the baseline system created by the organizers (Ryant et al 2021). We have used the toolkit 4 with the same frame-level acoustic features, embedding extractor, scoring method, etc.…”
Section: Methodsmentioning
confidence: 99%
“…In this work, we are concerned with only identifying the various domains of spoken documents and hence have only considered the Task 1 of DIHARD III (Ryant et al 2020) where the reference SAD was given. The diarization baseline provided with this challenge (Ryant et al 2021), which was based on one of the submissions of the predecessor challenge (Singh et al 2019), was used to benchmark our proposed SD system.…”
Section: Baseline Diarization Systemmentioning
confidence: 99%
“…Speaker diarization in the multi-party scenario is still a challenging task [1][2][3]. Diarization systems are subject to severe performance degradation when several speakers are overlapping, which may naturally occur in spontaneous speech.…”
Section: Introductionmentioning
confidence: 99%