“…Specifically, the task of supervised semantic segmentation has been widely studied in a series of works like VGG [49], U-net [46], DeepLab [9] and others [50,39,63,19,40]. However, the task of unsupervised image semantic segmentation using deep learning frameworks, where no labels are available has been less researched until recently [31,42,25,10]. Thse methods mostly rely on the concept of Mutual Information Maximization (MIM) which was used for image and volume registration in classical methods [57,55], and recently was incorporated in CNNs and GNNs by [27] and [54], respectively.…”