2024
DOI: 10.1007/s44163-024-00101-y
|View full text |Cite
|
Sign up to set email alerts
|

Wanet: weight and attention network for video summarization

Arpan Basu,
Rishav Pramanik,
Ram Sarkar

Abstract: In this paper, we propose a deep learning-based model, called Weight and Attention Network (WANet), for video summarization. The network comprises a simple multi-head attention mechanism, followed by a feed-forward network to obtain the frame importance scores. Summary keyshots are obtained from the scores using a combination of kernel temporal segmentation and the knapsack algorithm. Contrary to past methods, we first enrich the input frames with similar information as opposed to letting the model learn all t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 28 publications
0
0
0
Order By: Relevance