2020 IEEE International Conference on Multimedia and Expo (ICME) 2020
DOI: 10.1109/icme46284.2020.9102796
|View full text |Cite
|
Sign up to set email alerts
|

Mesh Saliency Detection Using Convolutional Neural Networks

Abstract: Mesh saliency has been widely considered as the measure of visual importance of certain parts of 3D geometries, distinguishable from their surroundings, with respect to human visual perception. This work is based on the use of convolutional neural networks to extract saliency maps fo large and dense 3D scanned models. The network is trained with saliency maps constructed with a fusion spectral and geometrical analysis generated measures. Extensive evaluation studies carried out, include visual perception evalu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…CfS-CNN [24]) by 126% and 23% in terms of LCC and AUC, respectively. The quantitative results indicate that 1) 3D visual saliency that predicts human visual attention on 3D surfaces might be perceptually related to 2D image saliency and categorical information of 3D objects, and 2) our method that combines the two types Spectral Processing [8], Point Clustering [13], Salient Regions [9], Hilbert-CNN [40], RPCA [37], CfS-CNN [24], the proposed MIMO-GAN-CRF and the ground-truth fixation maps provided by the 3DVA dataset [12]. Comparative results of more objects are available in the supplementary material.…”
Section: Evaluation On the 3dva Datasetmentioning
confidence: 86%
See 2 more Smart Citations
“…CfS-CNN [24]) by 126% and 23% in terms of LCC and AUC, respectively. The quantitative results indicate that 1) 3D visual saliency that predicts human visual attention on 3D surfaces might be perceptually related to 2D image saliency and categorical information of 3D objects, and 2) our method that combines the two types Spectral Processing [8], Point Clustering [13], Salient Regions [9], Hilbert-CNN [40], RPCA [37], CfS-CNN [24], the proposed MIMO-GAN-CRF and the ground-truth fixation maps provided by the 3DVA dataset [12]. Comparative results of more objects are available in the supplementary material.…”
Section: Evaluation On the 3dva Datasetmentioning
confidence: 86%
“…The two methods avoided the training relying on vertex-level saliency annotations but were not evaluated with eye fixation ground truth. Nousias et al [40] trained a CNN to detect mesh saliency using pseudo ground truth generated by the handcrafted approach proposed in [37], which does not perform well at predicting real human fixations according to our experimental results.…”
Section: Related Workmentioning
confidence: 90%
See 1 more Smart Citation
“…consisting of the k geometrical nearest vertices of vertex v i , estimated by using the k nearest neighbors (k-nn) algorithm (typically, we set k = 25). These patches are utilized to create n matrices N i ∈ R (k+1)×3 , consisting of the k + 1 corresponding normals [26]:…”
Section: B Spectral-based Saliency Estimationmentioning
confidence: 99%
“…A recent study employs CNNs to extract saliency maps on 3D meshes utilizing a multi-view setup [33]. Nousias et al [34] employed a CNN based mesh saliency extraction approach, employing a 3D geometric patch descriptor to classify faces into four classes. Greater class indices correspond to higher saliency values.…”
Section: A Saliency Map Extractionmentioning
confidence: 99%