“…D ILATED convolutions, also known as atrous convolutions, have been widely explored in deep convolutional neural networks (DCNNs) for various tasks, including semantic image segmentation [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], object detection [11], [12], [13], [14], audio generation [15], video modeling [16], and machine translation [17]. The idea of dilated filters was developed in the algorithm à trous for efficient wavelet decomposition in [18] and has been used in image pixel-wise prediction tasks to allow efficient computation [1], [2], [11], [12].…”