Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval 2017
DOI: 10.1145/3078971.3079026
|View full text |Cite
|
Sign up to set email alerts
|

Visual Descriptors in Methods for Video Hyperlinking

Abstract: In this paper, we survey different state-of-the-art visual processing methods and utilize them in hyperlinking. Visual information, calculated using Features Signatures, SIMILE descriptors and convolutional neural networks (CNN), is utilized as similarity between video frames and used to find similar faces, objects and setting. Visual concepts in frames are also automatically recognized and textual output of the recognition is combined with search based on subtitles and transcripts. All presented experiments w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…In contrast to these use cases, in the domain of endoscopic video no such additional input modalities are available. Galuščáková et al [13] investigate visual descriptors for the task of video hyperlinking within a multi-modal approach, i.e. they use visual (feature signatures, AlexNet fc7 CNN features, concept detection, and face recognition) and text-based (subtitles and automatic transcripts) input modalities.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast to these use cases, in the domain of endoscopic video no such additional input modalities are available. Galuščáková et al [13] investigate visual descriptors for the task of video hyperlinking within a multi-modal approach, i.e. they use visual (feature signatures, AlexNet fc7 CNN features, concept detection, and face recognition) and text-based (subtitles and automatic transcripts) input modalities.…”
Section: Related Workmentioning
confidence: 99%
“…Video hyperlinking systems usually start from a set of anchors that define entry points of interest in collections of long videos and are required to provide, for each anchor, relevant targets within the collection. This task is usually implemented as a two-step process, first starting from a segmentation of the long videos into small segments, then selecting relevant segments for a given anchor [4,5]. This last step is cast as a video retrieval task relying on video segment comparison, where various multimodal solutions have been proposed.…”
Section: Introductionmentioning
confidence: 99%