InfoSeg: Unsupervised Semantic Image Segmentation with Mutual Information Maximization

Harb, Robert; Knöbelreiter, Patrick

doi:10.1007/978-3-030-92659-5_2

Cited by 18 publications

(7 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other works propose to extract information from lowlevel features, for instance by considering the histogram of the red-green-blue (RGB) values of the image pixels [44] and by employing a Markov random field [12] to model the semantic relations of pixels. In the context of deep learning frameworks, there has been great improvement in recent years [42,25,31,10]. The common factor of those methods is the incorporation of the concept MIM, which measures the similarity between two tensors of possibly different sizes and from different sources [57].…”

Section: Unsupervised Semantic Segmentationmentioning

confidence: 99%

“…We compare our SGSeg both with 'classical' and deeplearning based methods, namely, K-Means [62], Doersch [13], Isola [28], IIC [31], AC [42], InMars [41] and InfoSeg [25]. Our results are summarized in Tab.…”

Section: Unsupervised Image Segmentationmentioning

confidence: 99%

“…Specifically, the task of supervised semantic segmentation has been widely studied in a series of works like VGG [49], U-net [46], DeepLab [9] and others [50,39,63,19,40]. However, the task of unsupervised image semantic segmentation using deep learning frameworks, where no labels are available has been less researched until recently [31,42,25,10]. Thse methods mostly rely on the concept of Mutual Information Maximization (MIM) which was used for image and volume registration in classical methods [57,55], and recently was incorporated in CNNs and GNNs by [27] and [54], respectively.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Unsupervised Image Semantic Segmentation Through Superpixels and Graph Neural Networks

2022

View full text Add to dashboard Cite

Section: Unsupervised Semantic Segmentationmentioning

confidence: 99%

Section: Unsupervised Image Segmentationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Unsupervised Image Semantic Segmentation Through Superpixels and Graph Neural Networks

2022

View full text Add to dashboard Cite

“…[21,23] group the similar pixel extracted from a randomly initialized CNN in both embedding and spatial space while keeping the diversity of embedding features. [16,20,28,29] maximize the mutual information between the pixel-level feature of two views from the same input image to distill the information shared across the image. PiCIE [10] disentangles features between different semantic objects by leveraging two simple rules, i.e.…”

Section: Unsupervised Segmentationmentioning

confidence: 99%

“…One way to tackle unsupervised image segmentation is to group low-level pixels into some semantic groups under the guidance of certain prior knowledge, i.e. the bottom-up manner [10,16,20,21,23,28,33], as shown in Figure 1. Those methods often assume pixels in the same semantic object share similar representation in the high-level semantic space.…”

Section: Introductionmentioning

confidence: 99%

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Yin¹,

Wang²,

Wang³

et al. 2021

Preprint

View full text Add to dashboard Cite

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on their visual cues or certain predefined rules. As a result, it is difficult for these bottom-up approaches to generate fine-grained semantic segmentation when coming to complicated scenes with multiple objects and some objects sharing similar visual appearance. In contrast, we propose the first top-down unsupervised semantic segmentation framework for fine-grained segmentation in extremely complicated scenarios. Specifically, we first obtain rich high-level structured semantic concept information from large-scale vision data in a self-supervised learning manner, and use such information as a prior to discover potential semantic categories presented in target datasets. Secondly, the discovered high-level semantic categories are mapped to low-level pixel features by calculating the class activate map (CAM) with respect to certain discovered semantic representation. Lastly, the obtained CAMs serve as pseudo labels to train the segmentation module and produce final semantic segmentation. Experimental results on multiple semantic segmentation benchmarks show that our topdown unsupervised segmentation is robust to both objectcentric and scene-centric datasets under different semantic granularity levels, and outperforms all the current stateof-the-art bottom-up methods. Our code is available at https://github.com/damo-cv/TransFGU .

show abstract

TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation

Yin¹,

Wang²,

Wang³

et al. 2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

InfoSeg: Unsupervised Semantic Image Segmentation with Mutual Information Maximization

Cited by 18 publications

References 17 publications

Unsupervised Image Semantic Segmentation Through Superpixels and Graph Neural Networks

Unsupervised Image Semantic Segmentation Through Superpixels and Graph Neural Networks

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation

Contact Info

Product

Resources

About