2022
DOI: 10.3390/bioengineering9120737
|View full text |Cite
|
Sign up to set email alerts
|

Surgical Gesture Recognition in Laparoscopic Tasks Based on the Transformer Network and Self-Supervised Learning

Abstract: In this study, we propose a deep learning framework and a self-supervision scheme for video-based surgical gesture recognition. The proposed framework is modular. First, a 3D convolutional network extracts feature vectors from video clips for encoding spatial and short-term temporal features. Second, the feature vectors are fed into a transformer network for capturing long-term temporal dependencies. Two main models are proposed, based on the backbone framework: C3DTrans (supervised) and SSC3DTrans (self-super… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…3DCNNs have been used successfully to learn volumetric data such as CT and MRI, and there are some reports of using 3DCNN to recognize surgical workflow, and others with some success in detecting cancer and abnormal growths in endoscopy [50][51][52][53]. However, there seems to be a paucity in the literature on applying the qualities of 3DCNN to learn spatiotemporal features together to classify between performance levels with laparoscopic surgical skills.…”
Section: Limitations and Key Findingsmentioning
confidence: 99%
“…3DCNNs have been used successfully to learn volumetric data such as CT and MRI, and there are some reports of using 3DCNN to recognize surgical workflow, and others with some success in detecting cancer and abnormal growths in endoscopy [50][51][52][53]. However, there seems to be a paucity in the literature on applying the qualities of 3DCNN to learn spatiotemporal features together to classify between performance levels with laparoscopic surgical skills.…”
Section: Limitations and Key Findingsmentioning
confidence: 99%
“…Fortunately, the technology at our disposal offers cost-effective solutions to these problems. Machine learning (ML) [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15], artificial intelligence (AI) [16][17][18][19][20][21][22][23][24][25][26][27][28][29] and Big Data [30][31][32][33][34][35] can provide robust and innovative answers to long-standing problems.…”
Section: Introductionmentioning
confidence: 99%