2012
DOI: 10.1007/978-3-642-34584-5_3
|View full text |Cite
|
Sign up to set email alerts
|

Ten Recent Trends in Computational Paralinguistics

Abstract: The field of computational paralinguistics is currently emerging from loosely connected research in speech analysis, including speaker classification and emotion recognition. Starting from a broad perspective on the state-of-the-art in this field, we combine these facts with a bit of 'tea leaf reading' to identify ten trends that might characterise the next decade of research: taking into account more tasks and task interdependencies, modelling paralinguistic information in the continuous domain, agglomerating… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 92 publications
0
8
0
Order By: Relevance
“…In our experiments, we used such evaluation metrics as the per class Accuracy, Precision, Recall, and F1-score. Due to the unequal number of samples in each test class (unequal priors), we have analyzed the results using Unweighted Average Recall (UAR) for multiclass classifiers, closely related to the accuracy as a good or even better metric to optimize when the sample class ratio is imbalanced [72]. UAR is defined as the average across the diagonal of the confusion matrix.…”
Section: Evaluation Setupmentioning
confidence: 99%
See 2 more Smart Citations
“…In our experiments, we used such evaluation metrics as the per class Accuracy, Precision, Recall, and F1-score. Due to the unequal number of samples in each test class (unequal priors), we have analyzed the results using Unweighted Average Recall (UAR) for multiclass classifiers, closely related to the accuracy as a good or even better metric to optimize when the sample class ratio is imbalanced [72]. UAR is defined as the average across the diagonal of the confusion matrix.…”
Section: Evaluation Setupmentioning
confidence: 99%
“…The test set for automatic evaluation consists of 33 separate samples of acting emotional speech of Russian children used in perception tests [21]. Both classifiers were trained based on the eGeMAPS feature set [72].…”
Section: Comparison Of the Subjective Evaluation And Automatic Emotio...mentioning
confidence: 99%
See 1 more Smart Citation
“…In such cases the quantization into a few categorical labels might lead to a loss in model representativeness [7]. In comparison with the categorical problem, only a few publications have addressed the dimensional recognition challenges, yet it has become a trend in the affective computing community [7], [40], [46], [47], [48], [49]. Some works approximated dimensional affect indicators with fine-grained quantization scales on segmented data, as in [42].…”
Section: Related Workmentioning
confidence: 99%
“…There is an increasing amount of research in that field [1][2][3] [4] and a number of Interspeech challenges in recent years have been organized with the intention to foster research in the many different aspects of paralanguage and to combine the sometimes scattered research efforts leveraging synergy effects [5].…”
Section: Introductionmentioning
confidence: 99%