2017
DOI: 10.1121/1.4977749
|View full text |Cite
|
Sign up to set email alerts
|

A transfer learning framework for predicting the emotional content of generalized sound events

Abstract: Predicting the emotions evoked by generalized sound events is a relatively recent research domain which still needs attention. In this work a framework aiming to reveal potential similarities existing during the perception of emotions evoked by sound events and songs is presented. To this end the following are proposed: (a) the usage of temporal modulation features, (b) a transfer learning module based on an echo state network, and (c) a k-medoids clustering algorithm predicting valence and arousal measurement… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 32 publications
0
10
0
Order By: Relevance
“…Such a jointly created space may offer improved prediction in multiple applications domains. To this end, we intent to exploit transfer learning technologies [12] forming a synergistic framework able to incorporate and transfer knowledge coming from multiple domains favoring diverse applications, such as music information retrieval, bioacoustic signal processing, etc.…”
Section: Resultsmentioning
confidence: 99%
“…Such a jointly created space may offer improved prediction in multiple applications domains. To this end, we intent to exploit transfer learning technologies [12] forming a synergistic framework able to incorporate and transfer knowledge coming from multiple domains favoring diverse applications, such as music information retrieval, bioacoustic signal processing, etc.…”
Section: Resultsmentioning
confidence: 99%
“…Identification of the emotion on the listener's side may be indicative of the respective human reaction. To this end, appropriate signal processing and pattern recognition algorithms can by employed [19].…”
Section: Future Perspectives: Machine Learning For Mir Purposesmentioning
confidence: 99%
“…Following the recent findings in the specific field, we used acoustic parameters able to capture characteristics associated with the context, i.e. mel-frequency cepstral coefficients and temporal modulation [17,18,19,20] modeled by means of both generative (having at their core the hidden Markov model technology) and discriminative (support vector machines [21] and echo state networks [22]) pattern recognition algorithms. Unlike deep learning techniques, which are typically characterized by low interpretability levels [23], we aimed at a comprehensive classification scheme potentially revealing useful information about the problem at hand.…”
Section: Introductionmentioning
confidence: 99%