“…Datasets and Metrics: Following most of the prior works [3,4,5,6,7,10,11,12] we evaluate our method on the PAS-CAL VOC 2012 semantic segmentation benchmark [14]. It includes images with pixel-wise class labels, of which 1,464, 1,449, and 1,456 are used for training, validation, and test, respectively.…”