“…As many environments, particularly indoor scenes, have been designed for people's daily usage, human behavioral priors can be leveraged to additionally reason about 2D or 3D scene observations. Various methods have been proposed to leverage human context as extra signal towards holistic perception to improve performance in scene understanding tasks such as semantic segmentation [11], layout detection from images [17,60], 3D object labeling [30], 3D object detection and segmentation [66], and 3D reconstruction [18,19].…”