2020
DOI: 10.3390/s20236941
|View full text |Cite
|
Sign up to set email alerts
|

Self-Supervised Learning to Detect Key Frames in Videos

Abstract: Detecting key frames in videos is a common problem in many applications such as video classification, action recognition and video summarization. These tasks can be performed more efficiently using only a handful of key frames rather than the full video. Existing key frame detection approaches are mostly designed for supervised learning and require manual labelling of key frames in a large corpus of training data to train the models. Labelling requires human annotators from different backgrounds to annotate ke… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(27 citation statements)
references
References 66 publications
0
27
0
Order By: Relevance
“…Because the direct comparison between key frame detection and our approach was not fair, we introduces related research in this section. In one recently reported study, Yan et al [6] provided a new method that required no annotation. Their proposed approach had a selfsupervised learning framework that could identify the key frames in a video.…”
Section: Key Frame Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Because the direct comparison between key frame detection and our approach was not fair, we introduces related research in this section. In one recently reported study, Yan et al [6] provided a new method that required no annotation. Their proposed approach had a selfsupervised learning framework that could identify the key frames in a video.…”
Section: Key Frame Detectionmentioning
confidence: 99%
“…The most related topic could be key frame detection, which aims to discover the frame that provides powerful features from a video. Yan et al [6] recently achieved key frame detection in a self-supervised manner, that is, at zero annotation cost. They verified their method by applying a thorough experiment on an action recognition dataset.…”
Section: Introductionmentioning
confidence: 99%
“…Additionally, the work in (Breck et al, 2019) presents a schema to detect errors in data pipelines for ML solutions. However, video-based data processing pipelines pose additional challenges related to key-frame identification (Yan et al, 2020) and efficient data storage and retrieval (Kang et al, 2019). In this work, we present end-to-end ML pipelines that are capable of ingesting large volumes of video data streams, identifying the key-frames that need annotation, and creating an efficient storage-retrieval schema that aid data collection for subsequent modeling cycles.…”
Section: Related Workmentioning
confidence: 99%
“…Computer Vision and scene understanding applications rely on processing large volumes of video data to learn the patterns related to the objects/regions of interest (ROIs) [1]. Therefore, automated video sequence processing systems for efficient frame tagging, storage and retrieval become key components for building such automated machine vision systems.…”
Section: Introductionmentioning
confidence: 99%