Proceedings of the 18th ACM International Conference on Multimedia 2010
DOI: 10.1145/1873951.1874092
|View full text |Cite
|
Sign up to set email alerts
|

Automatic trailer generation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 26 publications
(15 citation statements)
references
References 7 publications
0
15
0
Order By: Relevance
“…Recognising the sparseness of this classification feature space [1][2], we propose a novel methodology that utilises the representative power of compressive sensing [14] to facilitate the derivation of truly fused audio-visual words over which we then apply both SVM and Decision Forest [15] classification techniques. As the goal of our approach is general and unconstrained audio-visual scene classification, it is different from earlier feature concatenation techniques, such as the Audio-Video Concurrence (AVC) utilised in [7] or Affective Audio-Visual Words employed in [6]. In contrast to these and earlier feature concatenation based approaches we show that our compressive audio-visual feature representation facilitates a significant reduction in dimensionality with only marginal impact on the resulting classification performance.…”
Section: Introductionmentioning
confidence: 79%
See 4 more Smart Citations
“…Recognising the sparseness of this classification feature space [1][2], we propose a novel methodology that utilises the representative power of compressive sensing [14] to facilitate the derivation of truly fused audio-visual words over which we then apply both SVM and Decision Forest [15] classification techniques. As the goal of our approach is general and unconstrained audio-visual scene classification, it is different from earlier feature concatenation techniques, such as the Audio-Video Concurrence (AVC) utilised in [7] or Affective Audio-Visual Words employed in [6]. In contrast to these and earlier feature concatenation based approaches we show that our compressive audio-visual feature representation facilitates a significant reduction in dimensionality with only marginal impact on the resulting classification performance.…”
Section: Introductionmentioning
confidence: 79%
“…The simple concatenation of multi-modal feature representations is a commonplace [5] [6]. By contrast, here we look to the use of a compressive sensing derived methodology [14][24] as an approach for the combinatorial mapping of a multi-modal feature space into a single compressed multi-dimensional representation.…”
Section: Multi-modal Feature Representationmentioning
confidence: 99%
See 3 more Smart Citations