2018
DOI: 10.1007/978-3-030-01201-4_11
|View full text |Cite
|
Sign up to set email alerts
|

Temporal Coherence-based Self-supervised Learning for Laparoscopic Workflow Analysis

Abstract: In order to provide the right type of assistance at the right time, computer-assisted surgery systems need context awareness. To achieve this, methods for surgical workflow analysis are crucial. Currently, convolutional neural networks provide the best performance for video-based workflow analysis tasks. For training such networks, large amounts of annotated data are necessary. However, collecting a sufficient amount of data is often costly, time-consuming, and not always feasible. In this paper, we address th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 31 publications
(23 citation statements)
references
References 17 publications
0
23
0
Order By: Relevance
“…For example, one can use the recurrent neural network (RNN) or its variants to extract the MIS videos' long-term temporal information and model the temporal dependency between the frames to get better results. Moreover, there are limited datasets with location annotations of tools, the weakly supervised [39], [40] or self-supervised methods [41]- [43] can be used to reduce the dependency on spatially annotated data. To the best of our knowledge, the Cholec80 dataset [31] contains various real-world environments, so we will continue to label the Cholec80 dataset with the coordinates of bounding boxes of surgical tools to address the lack of public datasets and further promote the development of STD in MIS.…”
Section: Discussionmentioning
confidence: 99%
“…For example, one can use the recurrent neural network (RNN) or its variants to extract the MIS videos' long-term temporal information and model the temporal dependency between the frames to get better results. Moreover, there are limited datasets with location annotations of tools, the weakly supervised [39], [40] or self-supervised methods [41]- [43] can be used to reduce the dependency on spatially annotated data. To the best of our knowledge, the Cholec80 dataset [31] contains various real-world environments, so we will continue to label the Cholec80 dataset with the coordinates of bounding boxes of surgical tools to address the lack of public datasets and further promote the development of STD in MIS.…”
Section: Discussionmentioning
confidence: 99%
“…As previously stated, we evaluate our proposed active learning methods for surgical workflow analysis tasks (instrument presence and phase segmentation) on the Cholec80 dataset. For this, we first divide the dataset in 4 subsets of 20 videos each, as outlined in [6]. Each video was sampled at a rate of one frame per second and each frame was downsampled to a resolution of 384×216 pixels.…”
Section: Discussionmentioning
confidence: 99%
“…Long short-term memory units (LSTM), a more complex form of the RNN, can learn to strategically remember, but also forget, information from previ- ously seen inputs, while forgoing the problem of vanishing gradients common to RNNs [11]. Combining CNNs with LSTMs makes video-based workflow analysis, by using exclusively deep neural networks, possible [2,3,6,23].…”
Section: Recurrent Bayesian Networkmentioning
confidence: 99%
“…Such methods can be used for workflow analysis through automatic segmentation of procedures into phases or surgical actions. These methods rely, for example, on data from tool usage [28, 29] or robotic kinematics [30] or, by using DL, directly on camera data, such from an endoscope [20, 31, 32] or 3D camera [32]. DL methods for determining surgery duration (Fig.…”
Section: Enabling Ai-assisted Surgerymentioning
confidence: 99%