2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2016
DOI: 10.1109/globalsip.2016.7906042
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic scene classification using convolutional neural networks

Abstract: The task of classifying videos of natural dynamic scenes into appropriate classes has gained lot of attention in recent years. The problem especially becomes challenging when the camera used to capture the video is dynamic. In this paper, we analyse the performance of statistical aggregation (SA) techniques on various pre-trained convolutional neural network(CNN) models to address this problem.The proposed approach works by extracting CNN activation features for a number of frames in a video and then uses an a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(2 citation statements)
references
References 26 publications
0
2
0
Order By: Relevance
“…These available methods make an implicit assumption that the signal is present in vectorized form and convert a multi-dimensional signal into a single dimension before testing. Many real-world signals such as dynamic scene video [11] or tomographic images are inherently multi dimensional, which capture the spatial and temporal correlations within the data. However, by vectorizing the signal we lose the multi-dimensional structure of the data, which could be used to enhance the performance of the detector.…”
Section: Introductionmentioning
confidence: 99%
“…These available methods make an implicit assumption that the signal is present in vectorized form and convert a multi-dimensional signal into a single dimension before testing. Many real-world signals such as dynamic scene video [11] or tomographic images are inherently multi dimensional, which capture the spatial and temporal correlations within the data. However, by vectorizing the signal we lose the multi-dimensional structure of the data, which could be used to enhance the performance of the detector.…”
Section: Introductionmentioning
confidence: 99%
“…In the robotics field, feature extraction systems based on CNN models have been mainly applied for object recognition [42][43][44][45][46][47][48] and scene classification [51][52][53][54]. Concerning the object recognition task, recent advances have integrated object detection solutions by means of bounding box regression and object classification capabilities within the same CNN model [42][43][44].…”
Section: With Image Sensorsmentioning
confidence: 99%