“…The impressive development of deep neural networks, especially convolutional neural networks (CNNs; Garcia‐Garcia, Orts‐Escolano, Oprea, Villena‐Martinez, & Garcia‐Rodriguez, ), has led to a significant improvement in semantic segmentation approaches in recent years. Many robotic applications benefited from these improvements, for example, autonomous driving (Luc, Neverova, Couprie, Verbeek, & LeCun, ) and object detection and manipulation (Wong et al, ). For training, however, these methods require extensive amounts of pixel‐level labeled data.…”