“…Moreover, using transformers has been shown to be more promising in computer vision (Dosovitskiy et al, 2020; for utilizing long-range dependencies than other, traditional CNN-based methods. In parallel, transformers with powerful global relation modeling abilities have become the standard starting point for training on a wide range of downstream medical imaging analysis tasks, such as image segmentation Cao et al, 2021;Wang et al, 2021b;Valanarasu et al, 2021;Xie et al, 2021b), image synthesis (Kong et al, 2021;Ristea et al, 2021;Dalmaz et al, 2021), and image enhancement (Korkmaz et al, 2021;Luthra et al, 2021;Wang et al, 2021a).…”