“…Some methods, e.g. [45], fuse the sparse depth and RGB image via early fusion while others [18,26,29,37,44,64] utilize a late fusion scheme, or jointly utilize both the early and late fusion [36,65,68]. Another line of research focuses on utilizing affinity or geometric information of the scene via surface normal, occlusion boundaries, and the geometric convolutional layer [11,12,25,34,52,54,77,86].…”