2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) 2016
DOI: 10.1109/sibgrapi.2016.043
|View full text |Cite
|
Sign up to set email alerts
|

Bag of Genres for Video Retrieval

Abstract: This paper presents a higher level representation for videos aiming at video genre retrieval. In video genre retrieval, there is a challenge that videos may comprise multiple categories, for instance, news videos may be composed of sports, documentary, and action. Therefore, it is interesting to encode the distribution of such genres in a compact and effective manner. We propose to create a visual dictionary using a genre classifier. Each visual word in the proposed model corresponds to a region in the classif… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…8, where the regions are classified into two groups: the regions with animation (Anim) and the regions without animation (Real). In the Anim group there are eight regions labeled by [1,2,5,6,9,10,13,14], where the rest of them are in the Real group ([3, 4, 7, 8, 11, 12, 15, 16]). Fig.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…8, where the regions are classified into two groups: the regions with animation (Anim) and the regions without animation (Real). In the Anim group there are eight regions labeled by [1,2,5,6,9,10,13,14], where the rest of them are in the Real group ([3, 4, 7, 8, 11, 12, 15, 16]). Fig.…”
Section: Resultsmentioning
confidence: 99%
“…The visual information should be analyzed from various standpoints. Both static and dynamic visual information can be exploited using: time dependence [5]- [6], objects [7], motion [8], color, texture and similar descriptors [9]. The compressed domain may also be useful for such a task, as in [10] where the MPEG coefficients were applied.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, for the given example, the authority score A s ( q , 4) is equal to 14/4 2 . The time complexity for the computation of an authority score in Algorithm 1 (lines [11][12][13][14][15][16] is O ( k max 3 ), since the membership test in line 13 also needs to iterate through every object in N (q, k ) . On the other hand, the memoized version of the authority score computation given by Algorithm 3 is only O ( k max 2 ).…”
Section: Memoization Of Authority Scoresmentioning
confidence: 99%
“…The first approaches were mostly based on low-level features, such as color, shape, and texture [24,39] . Motion patterns [1] and spatio-temporal properties [13] are some examples of other features used for encoding video content. Recently, mid-and high-level features based on local descriptors [3] and deep learning [46] have emerged as promising alternative solutions.…”
Section: Introductionmentioning
confidence: 99%