2019
DOI: 10.1109/tcds.2018.2866587
|View full text |Cite
|
Sign up to set email alerts
|

Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(21 citation statements)
references
References 35 publications
0
21
0
Order By: Relevance
“…However, as shown in Figure 3b–f, the direct method may deform the object’s original ratio and geometric structure, which will influence the recognition performance. So, we used the scaling processing method proposed in [33]. At first, we resized the origin image so that the length of its long side becomes 227 pixels.…”
Section: Proposed Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, as shown in Figure 3b–f, the direct method may deform the object’s original ratio and geometric structure, which will influence the recognition performance. So, we used the scaling processing method proposed in [33]. At first, we resized the origin image so that the length of its long side becomes 227 pixels.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…The three streams include surface normal, color jet, and RGB channel. Tang et al proposed a canonical correlation analysis (CCA) based multi-view convolutional neural networks for RGB-D object recognition, which can effectively identify the associations between different perspectives of the same shaped model [33]. Zia et al proposed a hybrid 2D/3D convolutional neural network for RGB-D object recognition, which can be initialized with pretrained 2D CNN and can be trained over a relatively small RGB-D dataset [34].…”
Section: Related Workmentioning
confidence: 99%
“…They used a fusion saliency map of objects and a centered darker channel for object segmentation, multiple feature descriptors, feature matching, and Hough voting for the recognition of multiple objects over the RGB-D dataset. L. Tang et al [20] designed a convolution neural network framework based on canonical correlation analysis (CCA). They fused separately processed RGB and depth images through a CCA layer and a combining layer was introduced to the multi-view CNN.…”
Section: Sustainable Multi-objects Recognition Via Depth Imagesmentioning
confidence: 99%
“…The model contains three parts: deep networks (input layer, hidden layers, and output layer), feature concatenation and softmax classifier. The concatenation occurs at a higher layer instead of the input layer since concatenation at the input layer often causes 1) intractable training effort; 2) over-fitting due to prematurely learned features from both modalities; and 3) failure to learn implicit associations between modalities with different underlying features [48]. This model first learns the two modalities separately with two different flows, and then concatenate their features at a higher layer.…”
Section: The Multi-modal Model With Simple Concatenationmentioning
confidence: 99%