“…Moreover, a time-consuming global optimization is often adopted in order to improve depth accuracy [1, 3-6, 9, 11, 12]. Following the success of convolutional neural networks (ConvNets) in vision applications, some methods propose to adopt ConvNets to extract features for coarse estimates, followed by a global optimization [10,13] or refinement network [15] to improve the accuracy and/or speed. Recently, Epinet [14], a network trained end-to-end without postprocessing, achieves state-of-the-art accuracy onto the HCI and CVIA-HCI datasets [17,18].…”