“…[175] MS-TransUNet++ [178] 2022 MRI, CT prostate, liver liver tumor [107], prostate cancer [179] DSTUNet [180] 2022 MRI, CT abdominal, left ventricle, right ventricle, myocardium cardiac disease [113], colorectal cancer, ventral hernia [138], cardiac disease [181] SegTransVAE [182] 2022 CT, MRI kidney, brain kidney tumor [108], brain tumor [133] MT-UNet [ [113], [185], [186] ViTBIS [187] 2021 CT, MRI abdomen, brain brain tumor [123,122], colorectal cancer, ventral hernia [138] O-Net [88] 2022 dermoscopic, CT skin, abdomen melanoma [89], colorectal cancer, ventral hernia [138] connection part is a CNN-transformer-based encoder consisting of several convolutional multi-head attention blocks and multi-head attention blocks. The connection part takes the output of the encoder at different levels and its outputs are fed to the decoder.…”