20th International Conference on Content-Based Multimedia Indexing 2023
DOI: 10.1145/3617233.3617260
|View full text |Cite
|
Sign up to set email alerts
|

Video Memorability Prediction From Jointly-learnt Semantic and Visual Features

Iván Martín-Fernández,
Ricardo Kleinlein,
Cristina Luna-Jiménez
et al.

Abstract: The memorability of a video is defined as an intrinsic property of its visual features that dictates the fraction of people who recall having watched it on a second viewing within a memory game. Still, unravelling what are the key features to predict memorability remains an obscure matter. This challenge is addressed here by fine-tuning text and image encoders using a cross-modal strategy known as Contrastive Language-Image Pre-training (CLIP). The resulting video-level data representations learned include sem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 22 publications
0
0
0
Order By: Relevance