Interspeech 2017 2017
DOI: 10.21437/interspeech.2017-1732
|View full text |Cite
|
Sign up to set email alerts
|

A Semi-Supervised Learning Approach for Acoustic-Prosodic Personality Perception in Under-Resourced Domains

Abstract: Automatic personality analysis has gained attention in the last years as a fundamental dimension in human-to-human and human-to-machine interaction. However, it still suffers from limited number and size of speech corpora for specific domains, such as the assessment of children's personality. This paper investigates a semi-supervised training approach to tackle this scenario. We devise an experimental setup with age and language mismatch and two training sets: a small labeled training set from the Interspeech … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…An SVM was used as the classifier for all five feature types. [23] used weak supervision to iteratively refine the initial model trained on labeled data using the unlabeled data based on knowledge-based features that exploit expert knowledge of acoustic-prosodic cues. [24] employed an autoencoder to extract features and used an adaptive neuro-fuzzy inference system to model the uncertainty in speech.…”
Section: Resultsmentioning
confidence: 99%
“…An SVM was used as the classifier for all five feature types. [23] used weak supervision to iteratively refine the initial model trained on labeled data using the unlabeled data based on knowledge-based features that exploit expert knowledge of acoustic-prosodic cues. [24] employed an autoencoder to extract features and used an adaptive neuro-fuzzy inference system to model the uncertainty in speech.…”
Section: Resultsmentioning
confidence: 99%
“…The frequency-domain linear prediction and Mel Frequency Cepstral Coefficient (MFCC) features were extracted in [13]. In [14], the acoustic and duration-related features, such as silence ratio, speech duration ratio, and speech rate features, were proposed. Moreover, Guidi et al evaluated the impact of median/mean of the fundamental frequency of speech signal, the frame-toframe jitter factor, and the glottal flow spectral slope features on personality recognition [15].…”
Section: Introductionmentioning
confidence: 99%
“…Studies were not limited only to the acoustic and nonverbal features extraction. The lexical [8], [16], knowledge-based features [14], and BFI questionnaire scores [17] were added to this variety of features as well. The effort toward achieving a wider variety of features reached 6,373 statistical feature in [18].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation