Proceedings of the 2022 International Conference on Multimedia Retrieval 2022
DOI: 10.1145/3512527.3531424
|View full text |Cite
|
Sign up to set email alerts
|

The Impact of Dataset Splits on Classification Performance in Medical Videos

Abstract: Figure 1: Full Approach. We use GoogleNet and SIFT on two different data split strategies. Due to very similar images in the extracted frames, the random shuffle k-fold cross validation produces unrealistic high results on multiple metrics. SIFT confirms this similarity. The video-based split shows what we might except in terms of real-world performance.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 4 publications
0
3
0
1
Order By: Relevance
“…Fox et. al [21] used both the KFCV and normal split techniques to divide the ITEC LapGyn4 Gynecologic Laparoscopy Image Dataset [22] and employed them to classify the images using a convolutional neural network (CNN) model and scale-invariant feature transform (SIFT) classification. Their results show better performance for the KFCV using a CNN.…”
Section: Related Workmentioning
confidence: 99%
“…Fox et. al [21] used both the KFCV and normal split techniques to divide the ITEC LapGyn4 Gynecologic Laparoscopy Image Dataset [22] and employed them to classify the images using a convolutional neural network (CNN) model and scale-invariant feature transform (SIFT) classification. Their results show better performance for the KFCV using a CNN.…”
Section: Related Workmentioning
confidence: 99%
“…To the best of our knowledge, only two works addressing the analysis of dataset splits for surgical phase or instrument recognition have been published so far. Fox and Schoeffmann [29] show that random sampling of video frames without considering patient split might result in training and test sets containing video frames that are visually similar. This significantly distorts the evaluation results on the test set and yields overly optimistic results.…”
Section: Related Workmentioning
confidence: 99%
“…In the 40/-/40 split, which is used in the studies [7,36], all surgical phases are represented in both sets. However, a closer inspection of phase transitions unveils a group of nine surgeries (10,13,19,22,23,29,32,33,38) that deviate from the standard workflow by skipping the first phase and initiating the surgery directly in the second phase (see Fig. 4A).…”
Section: /-/40 Splitmentioning
confidence: 99%
“…Um bom extrator de características visuais deve garantir repetibilidade e precisão, sendo invariante a mudanças geométricas e fotométricas, ao mesmo tempo em que deve ser capaz de distinguir objetos diferentes (ABDEL-HAKIM;FARAG, 2006;GRAUMAN;LEIBE, 2011). Como exemplo desse tipo de descritor, podemos citar o Scale-Invariant Feature Transform (SIFT) (LOWE, 2004), usado em tarefas de análise multimídia (PATEL;DABHI;PRAJAPATI, 2020;MAHMOODZADEH, 2021;FOX;SCHOEFFMANN, 2022).…”
Section: Características Visuaisunclassified