2013
DOI: 10.1142/s1793351x13400023
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Information Fusion of Audio Emotion Recognition Based on Kernel Entropy Component Analysis

Abstract: This paper focuses on the application of novel information theoretic tools in the area of information fusion. Feature transformation and fusion is critical for the performance of information fusion, however, the majority of the existing works depend on second order statistics, which is only optimal for Gaussian-like distribution. In this paper, the integration of information fusion techniques and kernel entropy component analysis provides a new information theoretic tool. The fusion of features is realized usi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(10 citation statements)
references
References 19 publications
0
10
0
Order By: Relevance
“…Although the initial motivation of the developed method was to improve the efficiency of nonparametric foreground segmentation strategies, it must be noted that it can be also used in many other scientific fields such as computer vision [24], [25] or audio signal processing [26], [27], where the evaluation of continuous mathematical functions constitutes a significant computational burden.…”
Section: A Contributionmentioning
confidence: 99%
“…Although the initial motivation of the developed method was to improve the efficiency of nonparametric foreground segmentation strategies, it must be noted that it can be also used in many other scientific fields such as computer vision [24], [25] or audio signal processing [26], [27], where the evaluation of continuous mathematical functions constitutes a significant computational burden.…”
Section: A Contributionmentioning
confidence: 99%
“…It integrates the information in multiple modalities and therefore is expected to perform better prediction than the case using any unimodal information [1]. Nowadays it has been applied in a broad range of applications, such as multimedia event detection [2,3], sentiment analysis [1,4], cross-modal translation [5][6][7], Visual Question Answering (VQA) [8,9], etc.…”
Section: Introductionmentioning
confidence: 99%
“…A similar technique by Zhang et al (2016) also used global covariance PCA for feature fusion in the i-vector system. Other examples can be found in the literature on the use of PCA for speech feature fusion such as (Chibelushi et al, 1997), (Lee and Narayanan, 2005) and (Xie and Guan, 2013).…”
Section: Introductionmentioning
confidence: 99%