2017
DOI: 10.1007/s11042-017-4962-9
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning for content-based video retrieval in film and television production

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(15 citation statements)
references
References 29 publications
0
15
0
Order By: Relevance
“…Remember that an AP score of 1 means that the k relevant videos for a query are returned exactly in the top k positions in the ranked video list, which is the ideal result. The ResNet-101 configuration achieves a similar number of queries (21,611) with AP higher than 90%, but also obtains a very low AP, less than 10%, for 451 queries ( $ 2% of the total), which is the main cause for the overall lower MAP. This is also evident in Fig.…”
Section: Experiments On the Test Setmentioning
confidence: 98%
See 1 more Smart Citation
“…Remember that an AP score of 1 means that the k relevant videos for a query are returned exactly in the top k positions in the ranked video list, which is the ideal result. The ResNet-101 configuration achieves a similar number of queries (21,611) with AP higher than 90%, but also obtains a very low AP, less than 10%, for 451 queries ( $ 2% of the total), which is the main cause for the overall lower MAP. This is also evident in Fig.…”
Section: Experiments On the Test Setmentioning
confidence: 98%
“…Mu ¨hling et al [21] presented a system able to perform video search based on textual descriptions or face images, in addition to face identification and clustering. The authors used Faster R-CNN [22] for face detection and another CNN [23] for feature extraction.…”
Section: Face Video Retrieval and Related Tasksmentioning
confidence: 99%
“…The approach has reached good performance in city. Mühling et al use deep learning method for video content retrieval in films and TV programs, and achieve high retrieval rate in those videos [74]. Hu et al construct a deep incremental slow feature analysis (D-IncSFA) network, to implement video anomaly detection, which relies on hand-crafted representations [75].…”
Section: Mixed-stage Video Object Detectionmentioning
confidence: 99%
“…Other works have been studding visual semantics for large scale annotation, like [32,35,24]. Most recent works approach the problem with deep learning schemes which prove great performance [22,34]. However, CBVR methods require a lot of computational resources and are sometimes not feasible for large scale and real time applications as the one targeting in this work.…”
Section: Related Workmentioning
confidence: 99%