2013
DOI: 10.1016/j.specom.2011.12.003
|View full text |Cite
|
Sign up to set email alerts
|

Toward automating a human behavioral coding system for married couples’ interactions using speech acoustic features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
84
0
1

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 98 publications
(88 citation statements)
references
References 61 publications
3
84
0
1
Order By: Relevance
“…In comparison to the values of neutral statements, there was a significant difference in the pitch of statements classified as cues and concerns by the VR-CoDES. The statistical difference, higher f0 averages, indicates a trend consistent with the literature that individuals use an increase in the pitch of their voice when speaking with anxiety or other emotional arousal [13][14][15][16][17][18][19]. It is difficult to assign a level that would be considered a clinical difference.…”
Section: Discussionsupporting
confidence: 70%
See 1 more Smart Citation
“…In comparison to the values of neutral statements, there was a significant difference in the pitch of statements classified as cues and concerns by the VR-CoDES. The statistical difference, higher f0 averages, indicates a trend consistent with the literature that individuals use an increase in the pitch of their voice when speaking with anxiety or other emotional arousal [13][14][15][16][17][18][19]. It is difficult to assign a level that would be considered a clinical difference.…”
Section: Discussionsupporting
confidence: 70%
“…Reviewing the literature on vocal characteristics of human affect, fundamental frequency of pitch (f0), has been identified as one of the most reliable tools, essential for detecting emotional arousal using the voice [13][14][15][16][17][18][19]. However, no study has yet used this objective measure of emotional distress to characterise patient concern in the clinical oncology setting.…”
Section: Fundamental Frequency Of Pitch (F0)mentioning
confidence: 99%
“…Following this, we perform the feature extraction from speech regions. In our work we employ the preprocessing steps described in [12]. In short: We employ all available interactions with a SNR above 5dB, and perform VAD and Diarization.…”
Section: Audio Preprocessingmentioning
confidence: 99%
“…Over the last few years Behavioral Signal Processing (BSP) [9,10] has examined the analysis of such complex, domain specific behaviors. Based on machine learning techniques, BSP employed lexical [11], acoustic [12], and visual [13,14] information to analyze and model multimodal human behaviors. For instance, in couples' therapy domain, Black et al [12] built an automatic human behavioral coding system for couples interaction by using acoustic features.…”
Section: Introductionmentioning
confidence: 99%
“…In particular, human speech contains rich information for effectively conveying emotions and communicating wants, needs, and desires. The richness of human speech for understanding emotions within human interactions has motivated researchers to explore the area of emotion classification based on speech Black et al (2013).…”
Section: Introductionmentioning
confidence: 99%