2020
DOI: 10.48550/arxiv.2002.02921
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Temporal Segmentation of Surgical Sub-tasks through Deep Learning with Multiple Data Sources

Abstract: Many tasks in robot-assisted surgeries (RAS) can be represented by finite-state machines (FSMs), where each state represents either an action (such as picking up a needle) or an observation (such as bleeding). A crucial step towards the automation of such surgical tasks is the temporal perception of the current surgical scene, which requires a real-time estimation of the states in the FSMs. The objective of this work is to estimate the current state of the surgical task based on the actions performed or events… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(10 citation statements)
references
References 28 publications
0
10
0
Order By: Relevance
“…Surgical task such as suturing can be practically modeled as a Finite State Machine (FSM), with a list of discrete states (actions and non-actions) and possible transitions between states [17]. Classically, the task is formulated as a MM and the transition probability matrix is learned from data [8,12,[18][19][20].…”
Section: Endoscopic Visionmentioning
confidence: 99%
See 4 more Smart Citations
“…Surgical task such as suturing can be practically modeled as a Finite State Machine (FSM), with a list of discrete states (actions and non-actions) and possible transitions between states [17]. Classically, the task is formulated as a MM and the transition probability matrix is learned from data [8,12,[18][19][20].…”
Section: Endoscopic Visionmentioning
confidence: 99%
“…This is especially useful in the prediction of surgical states, since different sources of input data have their respective strengths and weaknesses in representing states with various kinematics and visual features. Previously, we have proposed a unified model for surgical state estimation -Fusion-KVE -that incorporated multiple types of input data and exceeded the state-of-the-art state estimation performance [17]. Building on this, we explored the task of concurrent instrument path and surgical state predictions with multiple data streams and the incorporation of historic state transition sequences.…”
Section: Endoscopic Visionmentioning
confidence: 99%
See 3 more Smart Citations