“…To facilitate geometric learning, several statistical parametric templates are developed for face [31], hands [48,61] and minimally clothed body [28,41,51,54]. To acquire animatable characters wearing casual clothes, traditional pipelines mostly reconstruct a subject-specific mesh template in advance, and then generate its motions using physics simulation [20,71], deformation space modeling [28], or deep learning [6,22,79]. The reliance on pre-scanning efforts can be eliminated via deforming a general body template, and several works proposed to directly learn this deformation from geometric data [42][43][44]57] or RGB videos [2][3][4][5].…”