2022
DOI: 10.1109/tcsvt.2021.3121680
|View full text |Cite
|
Sign up to set email alerts
|

Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes

Abstract: Real-time performance is a very important trait of semantic segmentation models aiming at applications in robotics and intelligent transportation systems. Most previous work in the field involves custom convolutional encoders trained from scratch, and decoders without lateral skip-connections. However, we argue that a better speedaccuracy trade-off is achieved with i) compact encoders designed for competitive ImageNet performance and ii) lightweight decoders with lateral skip-connections. Additionally, we prop… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 43 publications
(14 citation statements)
references
References 56 publications
0
14
0
Order By: Relevance
“…Huang et al [10] proposed the criss-cross attention to capture contextual information in an efficient way. Similar to [10], Liu et al [46] and Weng et al [15] focused on the efficiency of algorithms, and developed real-time and accurate segmentation methods. To better mine global contextual dependencies, Strudel et al [11] built the Segmenter on ViT [47].…”
Section: Related Work a Rgb Semantic Segmentationmentioning
confidence: 99%
“…Huang et al [10] proposed the criss-cross attention to capture contextual information in an efficient way. Similar to [10], Liu et al [46] and Weng et al [15] focused on the efficiency of algorithms, and developed real-time and accurate segmentation methods. To better mine global contextual dependencies, Strudel et al [11] built the Segmenter on ViT [47].…”
Section: Related Work a Rgb Semantic Segmentationmentioning
confidence: 99%
“…DeepLab [21] employs dilated convolution operations to enlarge receptive fields for non-local semantical context. SFANet [4] alleviates the misalignment gap and seeks the balance between accuracy and inference speed. Tian et al [26] dealt with semantic segmentation in the framework of unsupervised domain adaptation and proposed partial domain adaptation to avoid the negative transfer problem.…”
Section: B Edge-enhanced Semantic Segmentationmentioning
confidence: 99%
“…SED can be viewed as a dual-task of semantic segmentation. Differently, semantic segmentation decomposes an image into semantic regions by categorizing each pixel according to its spatial context [4]. The obtained semantic regions can be converted to semantic edges by keeping only boundary points and their semantical categories [5].…”
Section: Introductionmentioning
confidence: 99%
“…T HE task of single image reflection removal (SIRR) is to recover a clear transmission image by removing reflection from the blended image. This task is of significant importance in computational photography, as it not only enhances image quality but also has a positive impact on downstream computer vision tasks, such as object detection [1]- [3] and semantic segmentation [4], [5]. Since the reflection removal problem is Z. Zhang, Z.…”
Section: Introductionmentioning
confidence: 99%