Proceedings of the 25th ACM International Conference on Multimedia 2017
DOI: 10.1145/3123266.3123297
|View full text |Cite
|
Sign up to set email alerts
|

Query-adaptive Video Summarization via Quality-aware Relevance Estimation

Abstract: Although the problem of automatic video summarization has recently received a lot of a ention, the problem of creating a video summary that also highlights elements relevant to a search query has been less studied. We address this problem by posing queryrelevant summarization as a video frame subset selection problem, which lets us optimise for summaries which are simultaneously diverse, representative of the entire video, and relevant to a text query. We quantify relevance by measuring the distance between fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
36
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 81 publications
(37 citation statements)
references
References 52 publications
1
36
0
Order By: Relevance
“…relationship between rankings). We take advantage of these tools in measuring the similarities between the implicit rankings provided by generated and human annotated frame level importance scores as in [20].…”
Section: Evaluation Using Rank Order Statisticsmentioning
confidence: 99%
“…relationship between rankings). We take advantage of these tools in measuring the similarities between the implicit rankings provided by generated and human annotated frame level importance scores as in [20].…”
Section: Evaluation Using Rank Order Statisticsmentioning
confidence: 99%
“…There exist supervised methods based on complex audiovisual features that can become personalized by training on annotations coming from a single user [18]. Other personalized methods use text queries [17]. They suffer, however, from the cold start problem, not being able to provide recommendations for users that are not in the training set.…”
Section: Related Workmentioning
confidence: 99%
“…The segment is thus regarded to be related to a query if it is either semantically similar to the query, or is visually similar to other segments which are related to the query. In [14], a quality-aware relevance estimation is proposed that relies on measuring the distance using the cosine similarity between embeddings of the frames and the text queries, where the textual representation is achieved by first using the word2vec model [33], and then an Long Short-Term Memory (LSTM) [34] to encode each into a single fixed-length embedding. Summaries are then obtained by selecting key frames using a linear combination of submodular objectives which jointly consider diversity, representativeness and quality of the visual features, as well as the similarity between the query and frames.…”
Section: Query-conditioned Video Summarizationmentioning
confidence: 99%
“…Different from traditional video summarization which only focuses on video content, query-conditioned video summarization is tasked to generate user-oriented summaries conditioned on a given query in the form of text [4,[12][13][14][15]. As shown in Figure 1, different user queries for the same video will have different summary results, so that the summary can be tailored to different user interests, leading to a better personalized summary.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation