2021
DOI: 10.1007/978-3-030-87202-1_41
|View full text |Cite
|
Sign up to set email alerts
|

CataNet: Predicting Remaining Cataract Surgery Duration

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 21 publications
0
6
0
Order By: Relevance
“…Still however, sequences of only 10 frames are trained end to end. CataNet [38] proposes a complex, 4-stage learning process for ResNet-LSTMs to predict surgery duration. In the first two stages, ResNet and LSTM are trained separately, followed by an end-to-end stage and repeated finetuning of the LSTM.…”
Section: Surgical Workflow Analysismentioning
confidence: 99%
See 2 more Smart Citations
“…Still however, sequences of only 10 frames are trained end to end. CataNet [38] proposes a complex, 4-stage learning process for ResNet-LSTMs to predict surgery duration. In the first two stages, ResNet and LSTM are trained separately, followed by an end-to-end stage and repeated finetuning of the LSTM.…”
Section: Surgical Workflow Analysismentioning
confidence: 99%
“…Nwoye et al [39] justify their 2-stage approach through "fair comparison". And the authors of CataNet [38] deactivate BN in their public code repository 3 but do not discuss this in the paper. Rivoir et al [41] briefly mention BatchNorm's "cheating" to justify their choice of an AlexNet backbone for instrument anticipation.…”
Section: Surgical Workflow Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…To help train future surgeons and optimize surgical workflows, automated methods that analyze cataract surgery videos have gained significant traction in the last decade. With the prospect of reducing intra-operative and post-operative complications [5], recent methods have included surgical skill assessment [8,26], remaining surgical time estimation [13], irregularity detection [7] or relevance-based compression [6]. In addition, a reliable relevant-instance-segmentation approach is often a prerequisite for a majority of these applications [17].…”
Section: Introductionmentioning
confidence: 99%
“…Currently, various works have investigated anticipation of surgical workflow [2]- [8]. Most works are solely based on pixel-level visual features extracted by ResNet [9] and similar backbones, and learn these features directly with temporal models [10].…”
Section: Introductionmentioning
confidence: 99%