“…One of the areas in which this marriage is being fertile the most is the one of estimating the 3D shape of an object given an image, or to generate novel views of the same object. Indeed, pre deep learning methods [4,15,20,42,29,35] often need multiple views at test time and rely on the assumption that descriptors can be matched across views [13,1], handling poorly self-occlusions, lack of texture [32] and large viewpoint changes [25]. Conversely, more recent works [37,46,7,36,22,2] are built upon powerful deep learning models trained on virtually infinite synthetic data rendered from ShapeNet [5].…”