2018 IEEE International Conference on Robotics and Automation (ICRA) 2018
DOI: 10.1109/icra.2018.8461129
|View full text |Cite
|
Sign up to set email alerts
|

Endo-VMFuseNet: A Deep Visual-Magnetic Sensor Fusion Approach for Endoscopic Capsule Robots

Abstract: In the last decade, researchers and medical device companies have made major advances towards transforming passive capsule endoscopes into active medical robots. One of the major challenges is to endow capsule robots with accurate perception of the environment inside the human body, which will provide necessary information and enable improved medical procedures. We extend the success of deep learning approaches from various research fields to the problem of uncalibrated, asynchronous, and asymmetric sensor fus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 33 publications
0
10
0
Order By: Relevance
“…In the future, the robot speed will be increased to achieve faster motion and further reduce procedure duration. Other examples of the manipulation of magnetic endoscopes are available in the literature 42 , but they either lack localization 27 , an endoscope tether (required for interventional capabilities) 43 or an intelligent control system 44 , limiting the translation to clinical use. This Article shows a tethered magnetic endoscope successfully navigating the colon of a porcine model by means of a blended use of magnetic localization, closed-loop robotic control and elaboration of the endoscope camera image.…”
Section: Discussionmentioning
confidence: 99%
“…In the future, the robot speed will be increased to achieve faster motion and further reduce procedure duration. Other examples of the manipulation of magnetic endoscopes are available in the literature 42 , but they either lack localization 27 , an endoscope tether (required for interventional capabilities) 43 or an intelligent control system 44 , limiting the translation to clinical use. This Article shows a tethered magnetic endoscope successfully navigating the colon of a porcine model by means of a blended use of magnetic localization, closed-loop robotic control and elaboration of the endoscope camera image.…”
Section: Discussionmentioning
confidence: 99%
“…Different from supervised VO learning [2], [4], [6], where camera poses and/or depth ground truths are required to train the neural network, the core idea underlying our unsupervised pose and depth prediction method is to make use of the view synthesis constraint as the supervision metric, which forces the neural network to synthesize target image from multiple source images acquired from different camera poses. This synthesis is performed using estimated depth image, estimated target camera pose values in 6-DoF and nearby color values from source images.…”
Section: Methodsmentioning
confidence: 99%
“…We think that DL based endoscopic VO approach is more suitable for such challenge areas since the operation environment(GI tract) has similar organ tissue patterns among different patients which can be learned by a sophisticated machine learning approach easily. Even the dynamics of common artefacts such as non-rigidness, sensor occlusions, vignetting, motion blur and specularity across frame sequences could be learned and used for a better pose estimation, whereas our unsupervised odometry learning method additionally solves the common problem of missing labels on medical datasets from inner body operations [4], [6].…”
Section: Introductionmentioning
confidence: 99%
“…Above all, neither the absolute localization algorithm based on the sensor coordinate system, nor the relative localization algorithm based on the human coordinate system can provide the appropriate tracking results that can be effectively used for further endoscopy operations. Some researchers have utilized the simultaneous localization and mapping (SLAM) method to obtain position information [28,29,30,31,32,33]. Usually, vision information has been used for tracking by applying the learning methods.…”
Section: Introductionmentioning
confidence: 99%