Automatically and ideally segmenting the semantic region of each object in an image will greatly improve the precision and efficiency of subsequent image processing. We propose an automatic image segmentation algorithm based on superpixels and image-level labels. The proposed algorithm consists of three stages. At the stage of superpixel segmentation, we adaptively generate the initial number of superpixels using the minimum spatial distance and the total number of pixels in the image. At the stage of superpixel merging, we define small superpixels and directly merge the most similar superpixel pairs without considering the adjacency, until the number of superpixels equals the number of groupings contained in image-level labels. Furthermore, we add a stage of reclassification of disconnected regions after superpixel merging to enhance the connectivity of segmented regions. On the widely used Microsoft Research Cambridge data set and Berkeley segmentation data set, we demonstrate that our algorithm can produce high-precision image segmentation results compared with the state-of-the-art algorithms.
The task of semantic segmentation is to obtain strong pixel-level annotations for each pixel in the image. For fully supervised semantic segmentation, the task is achieved by a segmentation model trained using pixel-level annotations. However, the pixel-level annotation process is very expensive and time-consuming. To reduce the cost, the paper proposes a semantic candidate regions trained extreme learning machine (ELM) method with image-level labels to achieve pixel-level labels mapping. In this work, the paper casts the pixel mapping problem into a candidate region semantic inference problem. Specifically, after segmenting each image into a set of superpixels, superpixels are automatically combined to achieve segmentation of candidate region according to the number of image-level labels. Semantic inference of candidate regions is realized based on the relationship and neighborhood rough set associated with semantic labels. Finally, the paper trains the ELM using the candidate regions of the inferred labels to classify the test candidate regions. The experiment is verified on the MSRC dataset and PASCAL VOC 2012, which are popularly used in semantic segmentation. The experimental results show that the proposed method outperforms several state-of-the-art approaches for deep semantic segmentation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.