“…Stereo input was similarly explored [24,40,47]. With the wide availability of commodity depth sensors around 2010, the research also focused on monocular depth or RGBD input [10,20,29,31,36,37,43,55,56,57,61]. However, shortly thereafter, the success of deep learning lead to the proliferation of robust systems that perform on monocular RGB input [3,4,9,11,13,17,22,34,41,48,53,54,58,62,63].…”