Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-2324
|View full text |Cite
|
Sign up to set email alerts
|

The ACLEW DiViMe: An Easy-to-use Diarization Tool

Abstract: We present "DiViMe", an open-source virtual machine aimed at packaging speech technology for real-life data, and developed in the context of the "Analyzing Children's Language Environments across the World" Project. This first release focuses on Speech Activity Detection, Speaker Diarization, and their evaluation. The present paper introduces the set of included tools and the current workflow, which is focused on making minimal assumptions regarding users' technical skills. Additionally, we show how the curren… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1
1
1

Relationship

6
3

Authors

Journals

citations
Cited by 28 publications
(29 citation statements)
references
References 7 publications
0
29
0
Order By: Relevance
“…Indeed, the two systems included in DiViMe performed near the bottom in a recent challenge called DiHARD, aimed at assessing diarization performance in "difficult" datasets, such as meetings and doctor-patient interviews [33]. But it is likely that the present recordings are considerably more difficult even than their "difficult" datasets, since (according to [16]) the DiViMe pipelines applied to the DiHARD standardized evaluation set led to DERs of 65-72%. The difference between these DiHARD scores in the 70's and the DERs averaging 110% obtained for the Tsimane dataset represents the additional difficulty posed by these data.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Indeed, the two systems included in DiViMe performed near the bottom in a recent challenge called DiHARD, aimed at assessing diarization performance in "difficult" datasets, such as meetings and doctor-patient interviews [33]. But it is likely that the present recordings are considerably more difficult even than their "difficult" datasets, since (according to [16]) the DiViMe pipelines applied to the DiHARD standardized evaluation set led to DERs of 65-72%. The difference between these DiHARD scores in the 70's and the DERs averaging 110% obtained for the Tsimane dataset represents the additional difficulty posed by these data.…”
Section: Resultsmentioning
confidence: 99%
“…The Diarization Virtual MachinE (DiViMe for short) currently contains two tools that permit speech activity detection (i.e., detecting which portions of the recording contain some speech), and one tool for speaker diarization (i.e., attributing a speech portion to one or another speaker), which, combined, lead to two purely unsupervised pipelines yielding a segmentation of the recording into different speakers. In a recently accepted paper [16], global speech activity detection and talker diarization performance was reported.…”
Section: Diarization In Maximally Ecological Recordings: Data From Tsmentioning
confidence: 99%
“…However, now that the basic concept and its functionality have been validated, the next efforts should be directed toward the development and testing of a robust speaker diarization module required for speaker attribution. Although the current ACLEW virtual machine published in Le Franc et al (2018) already contains one such a tool, DiarTK (Vijayasenan & Valente, 2012), its performance was found to be lacking on child daylong data (see also DiHARD diarization challenge 10 where DiarTK scored at the bottom among all the submissions; see also Le Franc et al, 2018). In order to maintain focus on SAD and syllabifier comparisons, no separate experiments with diarization tools were included in the present report.…”
Section: Limitations and Future Workmentioning
confidence: 99%
“…Although the there is no real open-source, languagegeneral, and population-general version of LENA, there is a relatively easy-to-use, open alternative being developed: DiViMe (Le Franc et al, 2018;ACLEW/DiViMe, 2018).…”
Section: Available Alternative Systems To Lenamentioning
confidence: 99%