2021
DOI: 10.52586/5066
|View full text |Cite
|
Sign up to set email alerts
|

Speech depression recognition based on attentional residual network

Abstract: Background: Depressive disorder is a common affective disorder, also known as depression, which is characterized by sadness, loss of interest, feelings of guilt or low self-worth and poor concentration. As speech is easy to obtain non-offensively with low-cost, many researchers explore the possibility of depression prediction through speech. Adopting speech signals to recognize depression has important practical significance. Aiming at the problem of the complex structure of the deep neural network method used… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…Ditthapron et al [ 27 ] used smartphones to passively capture changes in acoustic characteristics of spontaneous speech for continuous traumatic brain injury monitoring. Spontaneous speech can also be used for research on depression [ 53 ] and aphasia [ 28 , 54 ]. Thus, our proposed spontaneous speech-based approach has the potential to be used in other clinical populations with acquired language disorders.…”
Section: Discussionmentioning
confidence: 99%
“…Ditthapron et al [ 27 ] used smartphones to passively capture changes in acoustic characteristics of spontaneous speech for continuous traumatic brain injury monitoring. Spontaneous speech can also be used for research on depression [ 53 ] and aphasia [ 28 , 54 ]. Thus, our proposed spontaneous speech-based approach has the potential to be used in other clinical populations with acquired language disorders.…”
Section: Discussionmentioning
confidence: 99%
“…To collect sensor data with sensors by using our tool presented in [ 14 ] would have required us to be in presence, because it is only a prototype, not easy to use. For this reason, we preferred to use pictures and speech in this preliminary version, reassured that speech is largely adopted in recent study on depression, see for example ([ 24 , 30 , 37 , 44 ]).…”
Section: Discussion and Limitationsmentioning
confidence: 99%
“…BDI-II score has been adopted to label the training dataset. Also He et al [ 24 ] used BDI-II scores for verifying their depression prediction when applying attentional residual network on Videos of the AVEC2013 and AVEC2014 datasets. The model also estimated the severity of depression.…”
Section: Introductionmentioning
confidence: 99%
“…Manual features such as spectral, source, prosodic, and formant features are commonly employed when analyzing depression and suicidality ( Cummins et al, 2015 ). Moreover, these features are also regarded as inputs to deep neural networks ( Lang and Cui, 2018 ; Lu X. et al, 2021 ). Studies have shown that the advanced features generated by MFCC feeding into the Short Long-Term Memory (LSTM) can preserve information related to depression ( Rejaibi et al, 2022 ).…”
Section: Related Workmentioning
confidence: 99%