2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022
DOI: 10.1109/wacv51458.2022.00367
|View full text |Cite
|
Sign up to set email alerts
|

S2-MLP: Spatial-Shift MLP Architecture for Vision

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
53
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 153 publications
(54 citation statements)
references
References 18 publications
0
53
0
1
Order By: Relevance
“…They have also been proved to achieve good performance on computer vision tasks. For example, a spatially shifted MLP (S 2 -MLP) (Yu et al, , 2022 takes spatial shifts in four directions and mixes them in a channel-wise manner to gather information from neighboring tokens. Similar to S 2 -MLP, an axial-shifted MLP (AS-MLP) (Lian et al, 2021) changes the spatial shifts in both the horizontal and vertical axes to gather local region information.…”
Section: Related Workmentioning
confidence: 99%
“…They have also been proved to achieve good performance on computer vision tasks. For example, a spatially shifted MLP (S 2 -MLP) (Yu et al, , 2022 takes spatial shifts in four directions and mixes them in a channel-wise manner to gather information from neighboring tokens. Similar to S 2 -MLP, an axial-shifted MLP (AS-MLP) (Lian et al, 2021) changes the spatial shifts in both the horizontal and vertical axes to gather local region information.…”
Section: Related Workmentioning
confidence: 99%
“…ViP [40] and sMLP [34] encode the feature representations along two axial dimensions to improve MLPs' efficiency and capability. Shift [41], ASMLP [29] and S 2 MLP [32] perform spatial information mixing with spatial shift operations along different dimensions. CycleMLP [31], WaveMLP [42] and MorphMLP [33] restrict the spatial information interaction within hand-craft local windows in a deterministic way.…”
Section: Mlp-like Modelsmentioning
confidence: 99%
“…Afterwards, Transformers spring up and make splendid breakthroughs on various vision tasks [13,14,15,16,17,18,19,20,21,22,23,24,25,26,27]. Most recently, the multi-layer perceptrons (MLPs) based architectures [28,29] have regained their light and been demonstrated capable of achieving stunning results on vision tasks [30,28,31,32,29,33,34]. A situation in which these three families of backbone architectures are contending has been formed.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations