“…Autoencoder networks, however, make it difficult to attain intuitive parametric control of the animation. Recent approaches aim to overcome this challenge using a keypointdriven motion flow field ] or a 3DMM [Ren et al 2021] to drive the portrait image edits. Yet, all these network architectures are oblivious to the underlying 3D structure of the content, making it challenging to produce photorealistic outputs across a range of viewpoints.…”