2021
DOI: 10.1007/s11263-021-01515-2
|View full text |Cite
|
Sign up to set email alerts
|

BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
301
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 967 publications
(414 citation statements)
references
References 59 publications
3
301
0
2
Order By: Relevance
“…The evaluation of image semantic segmentation of the converted classes of RELLIS-3D and RUGD were done using two state-of-the-art architectures: BiSeNetV2 [16] and HRNETV2 [15]+OCR [17]. BiSeNetV2 consists of two branches: detail branch and semantic branch.…”
Section: A Segmentationmentioning
confidence: 99%
“…The evaluation of image semantic segmentation of the converted classes of RELLIS-3D and RUGD were done using two state-of-the-art architectures: BiSeNetV2 [16] and HRNETV2 [15]+OCR [17]. BiSeNetV2 consists of two branches: detail branch and semantic branch.…”
Section: A Segmentationmentioning
confidence: 99%
“…Although these deep learning architectures achieved excellent segmentation performance, the dilation convolutions and the skip connections increase the computational complexity and memory overhead, leading to slow inference speeds. To solve this issue, Yu et al [ 19 ], [ 20 ] proposed a two-pathway architecture, known as the Bilateral Segmentation Network (BiSeNet), to speed up the segmentation inference time. The first pathway has wide channels and shallow layers to capture the spatial information from the image while the second pathway provides a large receptive field to capture wider context.…”
Section: Introductionmentioning
confidence: 99%
“…These two pathways concurrently generate feature representations, which significantly increase the segmentation efficiency. As demonstrated in [ 19 ], [ 20 ], this conceptual design is significantly faster than dilation and encoder-decoder architectures.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…These architectures indeed have fewer operations, but the decoders lack the ability of feature aggregation and may easily lose the fine-grained spatial details around boundaries or small objects. To make a balance between accuracy and speed, Changqian Yu et al [20] put forward an improved version of BiSeNet [19] (dubbed BiSeNet V2). The core components include Spatial Path and Context Path to cope with the loss of spatial information and shrinkage of receptive field, respectively.…”
Section: Introductionmentioning
confidence: 99%