2021
DOI: 10.1167/jov.21.7.13
|View full text |Cite
|
Sign up to set email alerts
|

Binocular vision supports the development of scene segmentation capabilities: Evidence from a deep learning model

Abstract: The application of deep learning techniques has led to substantial progress in solving a number of critical problems in machine vision, including fundamental problems of scene segmentation and depth estimation. Here, we report a novel deep neural network model, capable of simultaneous scene segmentation and depth estimation from a pair of binocular images. By manipulating the arrangement of binocular image pairs, presenting the model with standard left-right image pairs, identical image pairs or swapped left-r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 104 publications
(107 reference statements)
0
3
0
Order By: Relevance
“…Therefore, the lightweight DL model with high efficiency is also one of the future research directions. DL model for non‐2D images. Most DL models are still based on 2D images, but the binocular vision 200 and depth camera 201 provide technical support for 3D information acquisition. The acquired 3D information can help researchers to further study the behaviour of aquatic animals such as feeding behaviour, illness behaviour 179 and so on.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, the lightweight DL model with high efficiency is also one of the future research directions. DL model for non‐2D images. Most DL models are still based on 2D images, but the binocular vision 200 and depth camera 201 provide technical support for 3D information acquisition. The acquired 3D information can help researchers to further study the behaviour of aquatic animals such as feeding behaviour, illness behaviour 179 and so on.…”
Section: Discussionmentioning
confidence: 99%
“…Existing research work shows that the spatial resolution of the camera arrays can be improved by deep learning or image fusion with the help of the relationship between the images captured by different cameras [ 10 , 11 ]. Image fusion combines two or more images of the same object into a single image that is more easily interpreted than any of the originals [ 12 ].…”
Section: Introductionmentioning
confidence: 99%
“…There was a position gap between the pixels of the left and right images, which is a disparity. By calculating the disparity between corresponding points in the image, three-dimensional information about the object could be obtained using the triangulation principle [30]. The model of using stereo vision to calculate the threedimensional coordinates of target feature points is shown in Figure 5.…”
Section: Binocular Stereo Vision Measurement Algorithm 41 Three-dimen...mentioning
confidence: 99%