“…Recent breakthroughs in deep generative models, especially Variational Autoencoders (VAEs) [27], Generative Adversarial Networks (GANs) [12], and their variants [22,40,7,28], open a new door to a myriad of fashion applications in computer vision, including fashion design [25,49], language-guided fashion synthesis [73,47,13], virtual try-on systems [15,59,5], clothing-based appearance transfer [44,69], etc. Unlike generating images of rigid objects, fashion synthesis is more complicated as it involves multiple clothing items that form a compatible outfit.…”