“…With the development of deep neural networks (DNNs), early methods [1,14,25,48] formulated pose estimation as a regression problem, directly mapping the input image to the 6D object pose. More recently, most works [5, 20,22,32,34,37,38,40,41,42,43] draw inspiration from geometry and seek to predict 2D-3D correspon- pose Figure 1. Difference between other differentiable PnP losses and our proposed loss.…”