“…With advances in deep networks, Eigen et al [2] showed that monocular depth estimation with deep neural network yields significant gains in accuracy and speed compared to the traditional attempts with handcrafted features. Starting with the work of Eigen et al [2], many learning-based methods [1,3,4,5,6,7,8,9,10,11] have been studied and showed high accuracy in overall depth evaluation metrics. However, they still produce noisy and temporally flickering depth maps because they perform depth estimation on each frame independently or do not use temporal information correctly.…”