2020
DOI: 10.1515/cdbme-2020-0004
|View full text |Cite
|
Sign up to set email alerts
|

Domain gap in adapting self-supervised depth estimation methods for stereo-endoscopy

Abstract: In endoscopy, depth estimation is a task that potentially helps in quantifying visual information for better scene understanding. A plethora of depth estimation algorithms have been proposed in the computer vision community. The endoscopic domain however, differs from the typical depth estimation scenario due to differences in the setup and nature of the scene. Furthermore, it is unfeasible to obtain ground truth depth information owing to an unsuitable detection range of off-the-shelf depth sensors and diffic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
4
1

Relationship

3
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…Besides, due to the nature of data scarcity and data privacy in the surgical domain, it is particularly challenging to acquire intra-operative data in comparison to other imaging modalities. Furthermore, surgical data is highly heterogeneous, due to varying lighting conditions, acquisition angles, and the number and type of objects in the scene [63]. It can indeed be observed that the detector network Det sim is able to better learn the distribution of the simulator domain than the Det or network learns the distribution of the intraoperative domain (mean PPV +14.41, mean TPR +27.27, mean F 1 +0.2450; cf.…”
Section: Discussionmentioning
confidence: 98%
“…Besides, due to the nature of data scarcity and data privacy in the surgical domain, it is particularly challenging to acquire intra-operative data in comparison to other imaging modalities. Furthermore, surgical data is highly heterogeneous, due to varying lighting conditions, acquisition angles, and the number and type of objects in the scene [63]. It can indeed be observed that the detector network Det sim is able to better learn the distribution of the simulator domain than the Det or network learns the distribution of the intraoperative domain (mean PPV +14.41, mean TPR +27.27, mean F 1 +0.2450; cf.…”
Section: Discussionmentioning
confidence: 98%
“…However, this architecture is in disuse for depth learning since GANs produce depth maps that prioritize realism over accuracy. Self-supervised learning is a natural choice for real medical endoscopies to overcome the lack of depth labels on the target domain [37]- [39]. Although depth or stereo sensors are not common for in-vivo procedures, several works are trained with real stereo endoscopies [40]- [42].…”
Section: B Single-view Depth Learningmentioning
confidence: 99%
“…Deep learning has shown impressive results in complex computer vision tasks such as segmentation, depth perception, and pose estimation [7,26,30]. These approaches work well on feature rich datasets like road scenes but perform poorly for environments such as medical endoscopy as shown in [24]. This is because of poor texture information and the lack of photometric constancy between frames in endoscopy due to the joint motion between the camera and light source [14].…”
Section: Related Workmentioning
confidence: 99%