2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00692
|View full text |Cite
|
Sign up to set email alerts
|

Representation Compensation Networks for Continual Semantic Segmentation

Abstract: We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation using the encoder-decoder framework and introduce SegViTv2. In our work, we implement the decoder with the global attention mechanism inherent in ViT backbones and propose the light-weight Attentionto-Mask (ATM) module that effectively converts the global attention map into semantic masks for highquality segmentation results. Our decoder can outperform the most commonly-used decoder UpperNet in various ViT backbones while c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
43
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 68 publications
(43 citation statements)
references
References 113 publications
0
43
0
Order By: Relevance
“…CSWKD [41] weights the distillation loss based on the old and new class similarity. Other than knowledge distillation, RCIL [63] designs a two-branch module to decouple the representation learning of old and new classes. In multiorgan segmentation, only one study [31] applies CSS, based…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…CSWKD [41] weights the distillation loss based on the old and new class similarity. Other than knowledge distillation, RCIL [63] designs a two-branch module to decouple the representation learning of old and new classes. In multiorgan segmentation, only one study [31] applies CSS, based…”
Section: Related Workmentioning
confidence: 99%
“…Federated learning is a related solution [43], but it may not always be viable or easily accessible considering the requirement for sophisticated and expensive software/hardware computing infrastructures. Alternatively, we achieve this clinically preferred goal via continual semantic segmentation (CSS), which is emerging very recently in the natural image domain [5,11,36,37,63] but has been only scarcely studied for medical imaging [31,39].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Other techniques including mBCE and DKD in Sec. 3.2 can be applied to the off-the-shelf models, which brings significant performance gains, compared to current CISS methods [5,9,22,29] (See Tables 1 and 3).…”
Section: Numerical Values Of Zmentioning
confidence: 99%
“…Class-incremental semantic segmentation (CISS) adopts a CIL paradigm for the task of semantic segmentation. CISS methods [5,9,21,29] typically exploit a softmax cross-entropy (CE) term along with knowledge distillation (KD) [14]. Although the CE term helps to learn novel classes, applying the softmax function to all classes, including both old and novel ones, lowers class probabilities of old ones.…”
Section: Introductionmentioning
confidence: 99%