“…This problem falls into the general image-to-image translation literature [41,64]. Indeed, some might recall prior arts (e.g., pix2pix [41], CycleGAN [105], MUNIT [38], Bi-cycleGAN [106]), and sketch-specific variants [33,86] primarily based on pix2pix [41] claiming to have tackled the exact problem. We are strongly inspired by these works, but significantly differ on one key aspect -we aim to generate from abstract human sketches, not accurate photo edgemaps which are already "photorealistic".…”