“…Most previous works tackle 3D hand pose estimation [17,25,40,50,47] and object pose estimation [27,31,44,49] separately. Recently joint hand-object pose estimation has received more focus [14,26,28,12,8,13,11] due to the strong correlation when hands interact with objects. For learning-based methods, Hasson et al [14] propose attraction and repulsion losses to penalize physically implau-sible reconstructions.…”