“…The fundamental computer vision task of semantic segmentation seeks to assign class labels to specific pixels in a given input image. The rapid development of deep learning methods has resulted in significant progress in performance [85,17,83,49], stability [61,79], and accessibility [57,1] of semantic segmentation algorithms, often seen in real world applications such as autonomous vehicles [88,37,72,12,10,53], precision agriculture [5,4], medical diagnosis [82,73,39,60,67], image restoration and editing [54,44], sports [18], and remote sensing [6,41]. While such algorithms provide insightful information about the scene, it requires large amounts of pixel-wise labeled data [87,48,23], which is often expensive and time consuming to collect [9].…”