“…Multi-task learning with the goal of learning multiple tasks simultaneously [ 40 ] is nowadays based on CNNs [ 13 , 15 , 16 , 17 , 18 , 19 , 20 , 41 , 42 , 43 , 44 , 44 ]. Eigen and Fergus proposed a network structure that can estimate the depth, semantic labels, and the surface orientation of a scene [ 13 ].…”