2021
DOI: 10.1109/access.2021.3109904
|View full text |Cite
|
Sign up to set email alerts
|

Deep Edge Computing for Videos

Abstract: This paper provides a modular architecture with deep neural networks as a solution for realtime video analytics in an edge-computing environment. The modular architecture consists of two networks of Front-CNN (Convolutional Neural Network) and Back-CNN, where we adopt Shallow 3D CNN (S3D) as the Front-CNN and a pre-trained 2D CNN as the Back-CNN. The S3D (i.e., the Front CNN) is in charge of condensing a sequence of video frames into a feature map with three channels. That is, the S3D takes a set of sequential… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 37 publications
0
2
0
Order By: Relevance
“…In the temporal stream, T frames, each having dimensions of W × H × C, are selected by uniform sampling. Then, these T frames are condensed into a single frame of W × H × C by the S3D (Shallow 3D CNN) motion module [27]. Notably, the S3D module does not utilize fixed weights; instead, its weights are initialized and updated during training, enabling a more adaptive and robust representation of motion features.…”
Section: Data Processingmentioning
confidence: 99%
“…In the temporal stream, T frames, each having dimensions of W × H × C, are selected by uniform sampling. Then, these T frames are condensed into a single frame of W × H × C by the S3D (Shallow 3D CNN) motion module [27]. Notably, the S3D module does not utilize fixed weights; instead, its weights are initialized and updated during training, enabling a more adaptive and robust representation of motion features.…”
Section: Data Processingmentioning
confidence: 99%
“…The aim is to utilize available resources by task partitioning and pre-processing the video data using a DNN model. [ 52 ] proposes a solution for real-time videos by designing a Front-CNN consisting of a Shallow 3D CNN and pre-trained 2D CNN as the Back-CNN. This end-to-end trainable architecture is capable of learning both spatiotemporal information of videos thereby achieving state-of-the-art performance.…”
Section: Anomaly Detection At Edge Devices Using Machine Learningmentioning
confidence: 99%