2017
DOI: 10.1007/s00521-016-2824-4
|View full text |Cite
|
Sign up to set email alerts
|

Cross-model convolutional neural network for multiple modality data representation

Abstract: A novel data representation method of convolutional neural network (CNN) is proposed in this paper to represent data of different modalities. We learn a CNN model for the data of each modality to map the data of different modalities to a common space, and regularize the new representations in the common space by a cross-model relevance matrix. We further impose that the class label of data points can also be predicted from the CNN representations in the common space. The learning problem is modeled as a minimi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 38 publications
0
3
0
Order By: Relevance
“…10,14 As a result, the hash algorithm has emerged in the field of cross-modal recovery and has been widely considered by academia and industry for large-scale multimodal data retrieval. [27][28][29][30] Early CMH techniques were employed to acquire cross-modal similarity searches in which a mutual Hamming space is converted to by multimodal information/data and aligning heterogeneous feature spaces. [31][32][33][34][35] Cross-modal similarity sensitive hashing (CMSSH) 37 and cross-view hashing (CVH) 36 are early representatives of this type of method.…”
Section: Cmh Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…10,14 As a result, the hash algorithm has emerged in the field of cross-modal recovery and has been widely considered by academia and industry for large-scale multimodal data retrieval. [27][28][29][30] Early CMH techniques were employed to acquire cross-modal similarity searches in which a mutual Hamming space is converted to by multimodal information/data and aligning heterogeneous feature spaces. [31][32][33][34][35] Cross-modal similarity sensitive hashing (CMSSH) 37 and cross-view hashing (CVH) 36 are early representatives of this type of method.…”
Section: Cmh Methodsmentioning
confidence: 99%
“…There has been observed a significant increment in mapping cross‐modal information to a mutual subspace because of the sudden development of high‐latitude data 10,14 . As a result, the hash algorithm has emerged in the field of cross‐modal recovery and has been widely considered by academia and industry for large‐scale multimodal data retrieval 27–30 . Early CMH techniques were employed to acquire cross‐modal similarity searches in which a mutual Hamming space is converted to by multimodal information/data and aligning heterogeneous feature spaces 31–35 .…”
Section: Related Workmentioning
confidence: 99%
“…Its advantages are unified framework and easy implementation; its disadvantages are that compatibility and accuracy are difficult to choose from. Wu et al [30] proposed a new convolutional neural network data representation method for representing different forms of data. And learn the CNN model of each modal data, map different modal data to a public space, and regularize the new representation in the public space through a cross-model correlation matrix.…”
Section: Image-text Retrieval Model Based On Subspace Learning Methodsmentioning
confidence: 99%