“…NST, first introduced in Gatys et al [21], can be optimization-based [9,21,22] or modelbased [16,28,30,51]. It is inherently defined in the image domain by matching CNN features either globally or in a local, patch-based manner [21,31,34,36,41]. Thus, it cannot directly utilize 3D data like depth or surface normals of a mesh.…”