2022
DOI: 10.1109/tnsre.2022.3143117
|View full text |Cite
|
Sign up to set email alerts
|

The Reproducibility of Bio-Acoustic Features is Associated With Sample Duration, Speech Task, and Gender

Abstract: Bio-acoustic properties of speech show evolving value in analyzing psychiatric illnesses. Obtaining a sufficient speech sample length to quantify these properties is essential, but the impact of sample duration on the stability of bio-acoustic features has not been systematically explored. We aimed to evaluate bio-acoustic features' reproducibility against changes in speech durations and tasks. We extracted source, spectral, formant, and prosodic features in 185 English-speaking adults (98 w, 87 m) for reading… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 59 publications
0
3
0
Order By: Relevance
“…Scherer et al showed through research that the accuracy of disturbance measurement in the voice was affected by the duration of the voice, and only voice over 3 s could provide accurate features ( 44 ). There are also studies proving that the pitch measurement of long voice is more accurate than that of short voice ( 45 , 46 ). To conclude, in this study, we observed that patients with different emotional states exhibited varying voice durations, indicating that the information carried in their voices differs.…”
Section: Discussionmentioning
confidence: 99%
“…Scherer et al showed through research that the accuracy of disturbance measurement in the voice was affected by the duration of the voice, and only voice over 3 s could provide accurate features ( 44 ). There are also studies proving that the pitch measurement of long voice is more accurate than that of short voice ( 45 , 46 ). To conclude, in this study, we observed that patients with different emotional states exhibited varying voice durations, indicating that the information carried in their voices differs.…”
Section: Discussionmentioning
confidence: 99%
“…[ 36 ] A recent study by Almaghrabi et al investigated the reproducibility of bioacoustic features, showing that the stability of such features was easily affected by sample duration, speech task, and gender. [ 37 ] Despite the widespread interest, studies on reliability of automated acoustic measures still remain scarce, especially since most previous research has focused on voice disorders or psychiatric illnesses, and there has been no research on the reliability of acoustic tasks and features specific to dysarthria. To our knowledge, the present study is the first attempt to explore the reproducibility of automated acoustic measures in dysarthria using AI.…”
Section: Discussionmentioning
confidence: 99%
“…Artificial intelligence (AI) or machine learning (ML) models that can detect depression severity from speech could be a more practical alternative to increased clinician interviews (Almaghrabi et al, 2023). The pace of AI innovation in processing audiovisual features has led to many proof-of-concept models for mental health detection (Le Glaz et al, 2021) and depression, specifically (Almaghrabi et al, 2023;Cummins et al, 2015;He et al, 2022;Low et al, 2020). These models leverage well-documented changes in speech, affect, and presentation as depression severity increases (Kliper et al, 2016;Zhang et al, 2020).…”
mentioning
confidence: 99%