“…Extensive efforts have been spent on learning underlying low-dimensional representation and the generation process of high-dimensional data through deep generative models such as variational autoencoders (VAE) [27,35,9], generative adversarial networks (GANs) [11,12], normalizing flows [40,5], etc [48,17,8]. Particularly, enhancing the disentanglement and independence of latent dimensions has been attracting the attention of the community [4,43,3,34,45,23], enabling controllable generation that generates data with desired properties by interpolating latent variables [44,13,29,25,38,20,7,49]. For instance, CSVAE transfers image attributes by correlating latent variables with desired properties [28].…”