2020
DOI: 10.1155/2020/8841681
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Multimodal Adaptive Fusion (HMAF) Network for Prediction of RGB-D Saliency

Abstract: Visual saliency prediction for RGB-D images is more challenging than that for their RGB counterparts. Additionally, very few investigations have been undertaken concerning RGB-D-saliency prediction. The proposed study presents a method based on a hierarchical multimodal adaptive fusion (HMAF) network to facilitate end-to-end prediction of RGB-D saliency. In the proposed method, hierarchical (multilevel) multimodal features are first extracted from an RGB image and depth map using a VGG-16-based two-stream netw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 39 publications
0
2
0
Order By: Relevance
“…The distances that lie vertically, horizontally, and diagonally in the gradient vector are compared with a constant threshold. A distance greater than the threshold is set to the white pixel value of 1, and distances less than the threshold are set to the black pixel value 0 and, as a result, a binary image is formed of outer and internal boundaries of the hand [54]. The resulting directional image is shown in Figure 4.…”
Section: Directional Imagesmentioning
confidence: 99%
See 1 more Smart Citation
“…The distances that lie vertically, horizontally, and diagonally in the gradient vector are compared with a constant threshold. A distance greater than the threshold is set to the white pixel value of 1, and distances less than the threshold are set to the black pixel value 0 and, as a result, a binary image is formed of outer and internal boundaries of the hand [54]. The resulting directional image is shown in Figure 4.…”
Section: Directional Imagesmentioning
confidence: 99%
“…The segmented hand is then used for landmark detection. Many approaches are proposed to localize hand landmarks, which help in feature extraction for distinguishing and determining specific gestures [54][55][56][57][58]. The majority of techniques are quite simple and limit the exact localization of landmarks.…”
Section: Landmark Detectionmentioning
confidence: 99%