2018 IEEE 7th Global Conference on Consumer Electronics (GCCE) 2018
DOI: 10.1109/gcce.2018.8574727
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of Important Scenes in Soccer Videos Based on Collaborative Use of Audio-Visual CNN Features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…It is not common to extract audio features using VGG16 trained by using ImageNet. However, since the effectiveness of the audio feature extraction using a CNN model trained by using ImageNet has been reported in the method for detection of important scenes of other sports videos [27], the proposed method experimentally adopts VGG16 trained by using ImageNet. Thus, it is expected to be effective for our tasks.…”
Section: ) Audio Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…It is not common to extract audio features using VGG16 trained by using ImageNet. However, since the effectiveness of the audio feature extraction using a CNN model trained by using ImageNet has been reported in the method for detection of important scenes of other sports videos [27], the proposed method experimentally adopts VGG16 trained by using ImageNet. Thus, it is expected to be effective for our tasks.…”
Section: ) Audio Featuresmentioning
confidence: 99%
“…Comp. 12: This is a method based on [27] using a support vector machine (SVM) [39] for visual and audio features. In order to provide a fair comparison, a one-class SVM [40], which is an unsupervised method, was used instead of a general SVM for Comp.…”
Section: A Experimental Settingmentioning
confidence: 99%
“…Free kick scenes our previously reported method [35], and the main improvement in the proposed method is the introduction of whistle features. Furthermore, we introduce CM6, a method based on a CNN model that is pre-trained with VGG16 [41].…”
Section: Foul Scenesmentioning
confidence: 99%
“…By realizing this approach, various important scenes can be accurately extracted from far-view soccer videos. It should be noted that this paper is an extended version of [35].…”
Section: Introductionmentioning
confidence: 99%