“…There has been great progress in reconstructing or estimating the pose of a single hand [KS12,GRL*19,IMB*18,CCY*21,ZLM*19] or objects [HHFS19, KMT*17, PLH*19, ZSI19, LF20, ZHMW22, LZXQ21, YJLF22, CG22, ZBB21, SHCM21] alone over recent decades. Lacking good datasets labeling hands and objects together, early work on hand‐object interaction focused on recovering either the hand [RKK09, RKI*14] or object [TG15] pose in a interaction.…”