2017 IEEE Winter Conference on Applications of Computer Vision (WACV) 2017
DOI: 10.1109/wacv.2017.11
|View full text |Cite
|
Sign up to set email alerts
|

Recurrent Fully Convolutional Networks for Video Segmentation

Abstract: Image segmentation is an important step in most visual tasks. While convolutional neural networks have shown to perform well on single image segmentation, to our knowledge, no study has been done on leveraging recurrent gated architectures for video segmentation. Accordingly, we propose and implement a novel method for online segmentation of video sequences that incorporates temporal data. The network is built from a fully convolutional network and a recurrent unit that works on a sliding window over the tempo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
52
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 69 publications
(53 citation statements)
references
References 30 publications
1
52
0
Order By: Relevance
“…Until now, only very few approaches combining semantic segmentation approaches with recurrent structures have been proposed. One of the first approaches was the Recurrent Fully Convolutional Network (RFCN, [22]), which was introduced by Valipour et al in 2017. They extended the FCN approach [18] by placing a recurrent unit between the encoder and the decoder, and exhibit better performance on the SegTrack, Davis, and Moving MNIST dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Until now, only very few approaches combining semantic segmentation approaches with recurrent structures have been proposed. One of the first approaches was the Recurrent Fully Convolutional Network (RFCN, [22]), which was introduced by Valipour et al in 2017. They extended the FCN approach [18] by placing a recurrent unit between the encoder and the decoder, and exhibit better performance on the SegTrack, Davis, and Moving MNIST dataset.…”
Section: Related Workmentioning
confidence: 99%
“…In the literature, there are only a few approaches using recurrent neural networks (RNNs) for video segmentation, since the video sequences are usually split into its individual frames, which are processed independently of each other by state-of-the-art segmentation approaches such as ICNet [24], PSPNet [25] or Deeplab [2]. However, temporal image information is not considered in these works, but which can improve the segmentation accuracy further, as Valipour et al show in [22]. The authors showed on several datasets that the performance of the Fully Convolutional Network (FCN) [16] can be increased if a recurrent unit is placed between the encoder and the decoder.…”
Section: Related Workmentioning
confidence: 99%
“…Deep learning has achieved state-of-the-art segmentation performance in 2D natural images [3] and 3D medical images [7,9,8]. To leverage the temporal dependency and account for segmentation continuity, recurrent neural networks (RNNs) have been adopted for videos [12,16] and 2D+T cardiac MRI datasets [18]. Convolutional models can also represent temporal relationships and offer competitive performance for language translation [5].…”
Section: Related Workmentioning
confidence: 99%