2018
DOI: 10.1007/978-3-030-01258-8_45
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Semantic Scene Completion Network with Spatial Group Convolution

Abstract: We introduce Spatial Group Convolution (SGC) for accelerating the computation of 3D dense prediction tasks. SGC is orthogonal to group convolution, which works on spatial dimensions rather than feature channel dimension. It divides input voxels into different groups, then conducts 3D sparse convolution on these separated groups. As only valid voxels are considered when performing convolution, computation can be significantly reduced with a slight loss of accuracy. The proposed operations are validated on seman… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
81
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 102 publications
(81 citation statements)
references
References 44 publications
0
81
0
Order By: Relevance
“…Spatial Group Convolution To improve the computing efficiency of the 3D network. EsscNet [42] is introduced, rather than to conduct the group convolution on feature channel dimension, which adopts the group convolution on the spatial aspect. The drawback of spatial group convolution is that it splits the features manually into separate parts, which cause the performance drops.…”
Section: Computation-efficient Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Spatial Group Convolution To improve the computing efficiency of the 3D network. EsscNet [42] is introduced, rather than to conduct the group convolution on feature channel dimension, which adopts the group convolution on the spatial aspect. The drawback of spatial group convolution is that it splits the features manually into separate parts, which cause the performance drops.…”
Section: Computation-efficient Networkmentioning
confidence: 99%
“…However, the performance of both scene completion and semantic scene completion is around 6% higher than that of SSCNet. Compared with the Essc-Net [42], depth solely is used as the input for a fair comparison, our method is computationally cheaper than EsscNet with 6% reduction in FLOPS and increased performance. For SC and SSC tasks, EsscNet reaches the accuracies of 56.2% (SC) and 26.7% (SSC), and we achieve 59.0% (SC) and 28.9% (SSC).…”
Section: Quantitative Analysismentioning
confidence: 99%
“…Recently several methods have been proposed for SSC using deep learning techniques [3], [12], [13], [8]. Among them, the most representative work is the SSCNet [3] which conducts the semantic labeling and scene completion simultaneously and also proves that these two tasks can benefit from each other.…”
Section: A Semantic Scene Completionmentioning
confidence: 99%
“…Although better results have been achieved compared with the previous methods, SSCNet ignores the finegrained information of depth. Zhang et al [13] introduces spatial group convolution (SGC) to reduce the computation costs but with poor performance than SSCNet [3]. SEGCloud [9] employs fine-grained 3D point as input but the computing and memory costs are incredibly high.…”
Section: A Semantic Scene Completionmentioning
confidence: 99%
“…On NYU Kinect, the proposed approaches perform worse than the baseline [5] for semantic scene completion, but better for scene completion. The only approach that fairly outperforms the baseline is [23]. All the other approaches use either an additional modality (RGB images) [8], pretrain on SUNCG [5], or do both [9].…”
Section: Evaluation On Nyu Depth V2mentioning
confidence: 99%