2022
DOI: 10.1007/978-3-031-19809-0_9
|View full text |Cite
|
Sign up to set email alerts
|

Static and Dynamic Concepts for Self-supervised Video Representation Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 55 publications
0
6
0
Order By: Relevance
“…In linear probing settings, our method achieves the best results on both datasets. As the major counterparts with R(2+1)D backbone, DCLR [15] and SDC [43] also utilize frame difference as the source of motion information. FIMA outperforms DCLR and SDC in general, which implies we align the features of frame difference more precisely.…”
Section: Evaluation On Downstream Tasksmentioning
confidence: 99%
See 2 more Smart Citations
“…In linear probing settings, our method achieves the best results on both datasets. As the major counterparts with R(2+1)D backbone, DCLR [15] and SDC [43] also utilize frame difference as the source of motion information. FIMA outperforms DCLR and SDC in general, which implies we align the features of frame difference more precisely.…”
Section: Evaluation On Downstream Tasksmentioning
confidence: 99%
“…In the video domain, this learning diagram has also presented promising performance by keeping the instances within the same video semantically consistent [16,45]. However, vanilla video contrastive learning has difficulty modeling local temporal information [12,43] and possesses severe background bias [14,55], which limits the generalization and transferability of the learned representations. The reason may stem from the existence of static bias in the positive pair construction.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, following [28], when we predict the knee arthritis grades of patients by using x-ray images and a CBM, we set the concepts as clinical findings corrected by medical doctors, and thereby understand how clinical findings affect the predicted grades based on the correlation, by observing the learned weights in the last connection. Concept-based interpretation is used in knowledge discovery for chess [39], video representation [44], medical imaging [25], clinical risk prediction [45], computer aided diagnosis [27], and other healthcare domain problems [10]. CBM is a significant foundation for these applications, and advanced methods [49,44,27] have been proposed based on CBM.…”
Section: Introductionmentioning
confidence: 99%
“…Concept-based interpretation is used in knowledge discovery for chess [39], video representation [44], medical imaging [25], clinical risk prediction [45], computer aided diagnosis [27], and other healthcare domain problems [10]. CBM is a significant foundation for these applications, and advanced methods [49,44,27] have been proposed based on CBM. Hence, it is important to clarify the theoretical behavior of CBM.…”
Section: Introductionmentioning
confidence: 99%