Gated Feedback Refinement Network for Dense Image Labeling

Islam, Amirul; Rochan, Mrigank; Bruce, Neil D. B.; Wang, Yang

doi:10.1109/cvpr.2017.518

Cited by 190 publications

(149 citation statements)

References 54 publications

Supporting

Mentioning

149

Contrasting

Order By: Relevance

“…PASCAL VOC 2012 is a popular semantic segmentation dataset consisting of 1,464 images for training, 1,449 images for validation and 1,456 images for testing, which includes 20 object categories and one background class. Following prior work [7,35,20,32,7], we use the augmented training set that includes 10,582 images [14]. First, we report experimental results on the PASCAL VOC 2012 validation set.…”

Section: Results On Pascal Voc 2012 Datasetmentioning

confidence: 99%

See 1 more Smart Citation

Recurrent Iterative Gating Networks for Semantic Segmentation

Karim

Islam

Bruce

2019

2019 IEEE Winter Conference on Applications of Computer Vision (WACV)

Self Cite

View full text Add to dashboard Cite

In this paper, we present a canonical structure for controlling information flow in neural networks with an efficient feedback routing mechanism based on a strategy of Distributed Iterative Gating (DIGNet). The structure of this mechanism derives from a strong conceptual foundation, and presents a light-weight mechanism for adaptive control of computation similar to recurrent convolutional neural networks by integrating feedback signals with a feed forward architecture. In contrast to other RNN formulations, DIGNet generates feedback signals in a cascaded manner that implicitly carries information from all the layers above. This cascaded feedback propagation by means of the propagator gates is found to be more effective compared to other feedback mechanisms that use feedback from output of either the corresponding stage or from the previous stage. Experiments reveal the high degree of capability that this recurrent approach with cascaded feedback presents over feed-forward baselines and other recurrent models for pixel-wise labeling problems on three challenging datasets, PASCAL VOC 2012, COCO-Stuff, and ADE20K.

show abstract

Section: Results On Pascal Voc 2012 Datasetmentioning

confidence: 99%

“…There are a few specific considerations that motivate this paper, which presents a simple lightweight gating mechanism [42,20,28] that is top down wherein larger convolutional windows and more discriminative features play a role in guiding feedforward activation among earlier fea-…”

Section: Introductionmentioning

confidence: 99%

Recurrent Iterative Gating Networks for Semantic Segmentation

Karim

Islam

Bruce

2019

2019 IEEE Winter Conference on Applications of Computer Vision (WACV)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Through adding skip connections, U-Net [25] designs an elegant symmetric network architecture, which stacks convolutional features from the encoder to the decoder activations. More recently, more attention have been paid to RefineNets [9,12,26,27], which adopt ResNet [2] in encoder-decoder structure, and have been demonstrated very effective on several semantic segmentation benchmarks [20,28].…”

Section: Related Workmentioning

confidence: 99%

ESNet: An Efficient Symmetric Network for Real-Time Semantic Segmentation

Wang¹,

Zhou²,

Xiong³

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The recent years have witnessed great advances for semantic segmentation using deep convolutional neural networks (DCNNs). However, a large number of convolutional layers and feature channels lead to semantic segmentation as a computationally heavy task, which is disadvantage to the scenario with limited resources. In this paper, we design an efficient symmetric network, called (ESNet), to address this problem. The whole network has nearly symmetric architecture, which is mainly composed of a series of factorized convolution unit (FCU) and its parallel counterparts. On one hand, the FCU adopts a widely-used 1D factorized convolution in residual layers. On the other hand, the parallel version employs a transform-split-transform-merge strategy in the designment of residual module, where the split branch adopts dilated convolutions with different rate to enlarge receptive field. Our model has nearly 1.6M parameters, and is able to be performed over 62 FPS on a single GTX 1080Ti GPU. The experiments demonstrate that our approach achieves state-of-the-art results in terms of speed and accuracy trade-off for realtime semantic segmentation on CityScapes dataset.

show abstract

“…The ablation experiment results are shown in Table 2. In order to further boost the gradient backpropagation and information flow, we compute multiple losses for different aggregated feature map F i motivated by (Zhao et al 2017;Islam et al 2017;Fu et al 2017). Specifically, F i is fed to upsample module to obtain a feature map L i with channel C, where C is number of classes in prediction labels.…”

Section: Boundary-aware Lossmentioning

confidence: 99%

“…And the aggregated feature map is feature maps from all the previous blocks. Thus, each feature map in the encoder has much shorter path to loss compared with previous encoder-decoder structure (Lin et al 2017a;Islam et al 2017). The gradient backpropagation and information flowing is much more efficient.…”

Section: Boundary-aware Lossmentioning

confidence: 99%

Learning Fully Dense Neural Networks for Image Semantic Segmentation

Zhen

Wang

Zhou

et al. 2019

AAAI

View full text Add to dashboard Cite

Semantic segmentation is pixel-wise classification which retains critical spatial information. The "feature map reuse" has been commonly adopted in CNN based approaches to take advantage of feature maps in the early layers for the later spatial reconstruction. Along this direction, we go a step further by proposing a fully dense neural network with an encoderdecoder structure that we abbreviate as FDNet. For each stage in the decoder module, feature maps of all the previous blocks are adaptively aggregated to feedforward as input. On the one hand, it reconstructs the spatial boundaries accurately. On the other hand, it learns more efficiently with the more efficient gradient backpropagation. In addition, we propose the boundary-aware loss function to focus more attention on the pixels near the boundary, which boosts the "hard examples" labeling. We have demonstrated the best performance of the FDNet on the two benchmark datasets: PASCAL VOC 2012, NYUDv2 over previous works when not considering training on other datasets.

show abstract

Gated Feedback Refinement Network for Dense Image Labeling

Cited by 190 publications

References 54 publications

Recurrent Iterative Gating Networks for Semantic Segmentation

Recurrent Iterative Gating Networks for Semantic Segmentation

ESNet: An Efficient Symmetric Network for Real-Time Semantic Segmentation

Learning Fully Dense Neural Networks for Image Semantic Segmentation

Contact Info

Product

Resources

About