2018
DOI: 10.1109/tmi.2017.2787672
|View full text |Cite
|
Sign up to set email alerts
|

Articulated Multi-Instrument 2-D Pose Estimation Using Fully Convolutional Networks

Abstract: Instrument detection, pose estimation, and tracking in surgical videos are an important vision component for computer-assisted interventions. While significant advances have been made in recent years, articulation detection is still a major challenge. In this paper, we propose a deep neural network for articulated multi-instrument 2-D pose estimation, which is trained on detailed annotations of endoscopic and microscopic data sets. Our model is formed by a fully convolutional detection-regression network. Join… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
112
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 116 publications
(115 citation statements)
references
References 30 publications
3
112
0
Order By: Relevance
“…The method used a weighted cross-entropy loss to cope with the class imbalancing problem due to the small size of the target. [22] and [11] tracked surgical instruments using a deep network having an encoder-decoder architecture. Their approaches combined instrument segmentation and detection in a multi-task learning problem to make the tool detection in a cluttered background more robust.…”
Section: Interventional / Surgical Tool Trackingmentioning
confidence: 99%
“…The method used a weighted cross-entropy loss to cope with the class imbalancing problem due to the small size of the target. [22] and [11] tracked surgical instruments using a deep network having an encoder-decoder architecture. Their approaches combined instrument segmentation and detection in a multi-task learning problem to make the tool detection in a cluttered background more robust.…”
Section: Interventional / Surgical Tool Trackingmentioning
confidence: 99%
“…There exist a number of algorithms to perform visionbased surgical tool tracking [3]. Also ConvNets have been shown to work well for this task [9,19]. In one study, Jin et al [16] use region-based ConvNets for instrument detection and localization.…”
mentioning
confidence: 99%
“…For the ablation study, inspired by [32], we compared the performance of the proposed framework with the detectiononly and regression-only architectures. Both were implemented in a spatio-temporal fashion (i.e., with 3D convolution).…”
Section: Ablation Study and Comparison With The State Of The Artmentioning
confidence: 99%