2021
DOI: 10.1002/cta.2924
|View full text |Cite
|
Sign up to set email alerts
|

A novel FCNs‐ConvLSTM network for video salient object detection

Abstract: Summary A video saliency detection model is proposed based on deep learning, which improves the existing fully convolutional network (FCN)‐based model by introducing a convolutional long short‐term memory (ConvLSTM) module. The ConvLSTM splits the input into two flows with two layers in each one. The two flows have different dilation rates that make them have different receptive fields, which enables the proposed model to perform better in depicting the contour of objects. The ConvLSTM module receive frames in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…The difference between ConvLSTM and LSTM is that ConvLSTM takes in 3D data instead of 1D data as its input and produce 3D output vectors. ConvLSTM has been applied in change detection of hyperspectral images [24], video salient object detection [25], and Controllable Space-Time Video Super-Resolution [26]. Both LRCN and ConvLSTM are deeply explain in Section 2.5.…”
Section: Introductionmentioning
confidence: 99%
“…The difference between ConvLSTM and LSTM is that ConvLSTM takes in 3D data instead of 1D data as its input and produce 3D output vectors. ConvLSTM has been applied in change detection of hyperspectral images [24], video salient object detection [25], and Controllable Space-Time Video Super-Resolution [26]. Both LRCN and ConvLSTM are deeply explain in Section 2.5.…”
Section: Introductionmentioning
confidence: 99%
“…Meanwhile, to improve the accuracy and robustness of power line detection, some technologies for image augmentation are also applied in enhancing the original images and reduce the disturbance of background noise as well [10]. As a matter of fact, whether the final detection performances of the above methods are useful mainly depends on the previous steps [12]. For example, the line detection and the selected edge candidates are easily interfered by the background and blurred cables of some images, which may affect the results to some extent [13].…”
Section: Introductionmentioning
confidence: 99%
“…The video salient object detection (VSOD), also known as zero-shot video segmentation [1], [2], [3], [4], [5], [6], has received extensive research attention in recent years, whose primary objective is to segment video objects that attract the human visual attention most [7], [8], [9]. Different from the widely studied image salient object detection (ISOD) using spatial information only [10], [11], [12], the temporal information provided by the video data makes the saliency detection task more difficult [13], [14], [15], and we give an in-depth discussion regarding this issue to clearly demonstrate our motivation.…”
Section: Introductionmentioning
confidence: 99%