2015
DOI: 10.1109/tmm.2015.2477058
|View full text |Cite
|
Sign up to set email alerts
|

Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO

Abstract: Abstract-Heterogeneous feature representations are widely used in machine learning and pattern recognition, especially for multimedia analysis. The multi-modal, often also highdimensional, features may contain redundant and irrelevant information that can deteriorate the performance of modeling in classification. It is a challenging problem to select the informative features for a given task from the redundant and heterogeneous feature groups. In this paper, we propose a novel framework to address this problem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

1
45
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 112 publications
(46 citation statements)
references
References 44 publications
1
45
0
Order By: Relevance
“…The HGR maximal correlation is originally defined on two random variables. In contrast to reconstruction models (Srivastava and Salakhutdinov 2012; Zhao, Hu, and Wang 2015), the multi-modal extension for correlation based models is not straightforward. New modalities will bring additional whitening constraints, and the computational complexities scales up.…”
Section: Extension To More or Missing Modalitiesmentioning
confidence: 99%
“…The HGR maximal correlation is originally defined on two random variables. In contrast to reconstruction models (Srivastava and Salakhutdinov 2012; Zhao, Hu, and Wang 2015), the multi-modal extension for correlation based models is not straightforward. New modalities will bring additional whitening constraints, and the computational complexities scales up.…”
Section: Extension To More or Missing Modalitiesmentioning
confidence: 99%
“…In recent times, Deep Learning techniques are widely being used to solve many problems of computer vision (e.g., [11][12] [13] [14] [15]). Although Deep learning is preferred to Sparse Representations (SR) to improve retrieval accuracy in content-based image retrieval (CBIR) problems [16][17] [18] [19], these approaches can also be combined for the same purpose [20][21] [22] [23]. This was the main motivation of the current research.…”
Section: Introductionmentioning
confidence: 99%
“…For CBIR, the CNN can be used as a feature extractor, and the resultant features applied to present the image contents. Although Deep learning is preferred over SR to improve retrieval accuracy in CBIR problems [33][34] [35] [36], these algorithms have also been employed together with the same aim [37] [38][39] [40]. Therefore, this study presents an extensive number of experiments to figure out the best combination between these two leading approaches to miximise the performance of CBIR systems.…”
mentioning
confidence: 99%