2019
DOI: 10.1109/access.2019.2949076
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Semantic Segmentation Using Spatio-Channel Dilated Convolutions

Abstract: There has been an increasing interest in reducing the computational cost to develop efficient deep convolutional neural networks (DCNN) for real-time semantic segmentation. In this paper, we introduce an efficient convolution method, Spatio-Channel dilated convolution (SCDC) which is composed of structured sparse kernels based on the principle of split-transform-merge. Specifically, it employs the kernels whose shapes are dilated, not only in spatial domain, but also in channel domain, using a channel sampling… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…Definition 2: mean intersection over union (MIOU) [13]; calculating the intersection and union ratio to measure the advantages and disadvantages of the algorithm, it is one of the important evaluation indexes in the semantic segmentation model. Here, the intersection and union ratio is the ratio of overlap between the standard labeling of the dataset and the predicted segmentation.…”
Section: Accuracymentioning
confidence: 99%
“…Definition 2: mean intersection over union (MIOU) [13]; calculating the intersection and union ratio to measure the advantages and disadvantages of the algorithm, it is one of the important evaluation indexes in the semantic segmentation model. Here, the intersection and union ratio is the ratio of overlap between the standard labeling of the dataset and the predicted segmentation.…”
Section: Accuracymentioning
confidence: 99%
“…It consists of a learning to down-sample module, which is a coarse global feature extractor, feature fusion module, and standard classifier. ESCNet [ 19 ] utilizes an efficient spatio-channel dilated convolution (ESC) module, which is an efficient multi-level dilated convolution module, to accomplish various receptive fields with reduced network parameters and computational complexity. EFSNet [ 42 ] propose the continuous shuffle dilated convolution (CSDC) module for less calculational effort.…”
Section: Related Workmentioning
confidence: 99%
“…Hence, to satisfy the requirements of mobile environments, light but efficient deep-neural network-based methods have been proposed, including ENet [ 15 ], ICNet [ 16 ], ESPNet [ 17 ], ERFNet [ 18 ], and ESCNet [ 19 ]. The above methods can reduce the memory and the number of complexities while presenting accurate performances.…”
Section: Introductionmentioning
confidence: 99%
“…We also adopt various experiments with different groups (G), kernel size (K) and staking times (N) to obtain a good trade-off between accuracy performance and time consumption. For a fair comparison, the light-weight segmentation network ESPNet-C [16] and the encoder of the ESCNet [37] are used as baselines to compare parameters, mIoU and inference speed obtained using the Cityscapes validation set.…”
Section: ) Encoder Network With Csdc Modulementioning
confidence: 99%
“…More specifically, EFSNet 1 achieves 54.3 % mIoU with 76.8 k parameters, which is approximately 1.0 % higher than ESPNet. Compared with ESCNet [37], EFSNet 2 achieves 57.2 % mIoU with only 154.2 k parameters with an inference speed of 237.5 FPS.…”
Section: C: Depth Factor (N)mentioning
confidence: 99%