Disentangling Geometry and Appearance with Regularised Geometry-Aware Generative Adversarial Networks

Tran, Linh; Kossaifi, Jean; Panagakis, Yannis; Pantić, Maja

doi:10.1007/s11263-019-01155-7

Cited by 18 publications

(16 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…[26]. While the majority of existing work on disentanglement focuses on a (semi-)supervised setting [27][28][29][30][31][32][33][34], our work focuses on the unsupervised seeting. Here, we review the most closely related methods below.…”

Section: Comparison With Other Modelsmentioning

confidence: 99%

Unsupervised Controllable Generation with Self-Training

Chrysos

Kossaifi

et al. 2021

2021 International Joint Conference on Neural Networks (IJCNN)

Self Cite

View full text Add to dashboard Cite

Recent generative adversarial networks (GANs) are able to generate impressive photo-realistic images. However, controllable generation with GANs remains a challenging research problem. Achieving controllable generation requires semantically interpretable and disentangled factors of variation. It is challenging to achieve this goal using simple fixed distributions such as Gaussian distribution. Instead, we propose an unsupervised framework to learn a distribution of latent codes that control the generator through self-training. Self-training provides an iterative feedback in the GAN training, from the discriminator to the generator, and progressively improves the proposal of the latent codes as training proceeds. The latent codes are sampled from a latent variable model that is learned in the feature space of the discriminator. We consider a normalized independent component analysis model and learn its parameters through tensor factorization of the higher-order moments. Our framework exhibits better disentanglement compared to other variants such as the variational autoencoder, and is able to discover semantically meaningful latent codes without any supervision. We demonstrate empiracally on both cars and faces datasets that each group of elements in the learned code controls a mode of variation with a semantic meaning, e.g. pose or background change. We also demonstrate with quantitative metrics that our method generates better results compared to other approaches.Preprint. Under review.

show abstract

Section: Comparison With Other Modelsmentioning

confidence: 99%

Unsupervised Controllable Generation with Self-Training

Chrysos

Kossaifi

et al. 2021

2021 International Joint Conference on Neural Networks (IJCNN)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Deep learning techniques have gained increased attention as their design requires minimal prior knowledge and the models can be fine-tuned to scale to different environments [ 20 ]. These models have been enhanced using recurrent neural networks (RNN) that memorize long-term dependencies and tackle autonomous driving as partially observable Markov decision processes (POMDP) [ 21 ].…”

Section: Literature Reviewmentioning

confidence: 99%

“…All past and present states are in the continuous state-space [ 34 ]. The parameter values, that is, the set of velocities in a given timestep, at present and past instances for a given state S are obtained for the trajectory followed by the vehicle described by [ 20 , 40 ]:

where

is the difference between two subsequent timeframes while the vehicle navigates the trajectory. These parameters are used to calculate the optimal value function

and optimal Q-value

.…”

Section: Problem Formulationmentioning

confidence: 99%

See 1 more Smart Citation

Policy-Gradient and Actor-Critic Based State Representation Learning for Safe Driving of Autonomous Vehicles

Gupta

Khwaja

Anpalagan

et al. 2020

Sensors

View full text Add to dashboard Cite

In this paper, we propose an environment perception framework for autonomous driving using state representation learning (SRL). Unlike existing Q-learning based methods for efficient environment perception and object detection, our proposed method takes the learning loss into account under deterministic as well as stochastic policy gradient. Through a combination of variational autoencoder (VAE), deep deterministic policy gradient (DDPG), and soft actor-critic (SAC), we focus on uninterrupted and reasonably safe autonomous driving without steering off the track for a considerable driving distance. Our proposed technique exhibits learning in autonomous vehicles under complex interactions with the environment, without being explicitly trained on driving datasets. To ensure the effectiveness of the scheme over a sustained period of time, we employ a reward-penalty based system where a negative reward is associated with an unfavourable action and a positive reward is awarded for favourable actions. The results obtained through simulations on DonKey simulator show the effectiveness of our proposed method by examining the variations in policy loss, value loss, reward function, and cumulative reward for `VAE+DDPG’ and `VAE+SAC’ over the learning process.

show abstract

“…Conditional signal generation leverages a conditioning label, e.g. a prior shape (Tran et al 2019) or an embedded representation (Mirza and Osindero 2014), to produce the target signal. In this work, we focus on the latter setting, i.e.…”

Section: Conditional Ganmentioning

confidence: 99%

RoCGAN: Robust Conditional GAN

2020

Self Cite

View full text Add to dashboard Cite

Conditional image generation lies at the heart of computer vision and conditional generative adversarial networks (cGAN) have recently become the method of choice for this task, owing to their superior performance. The focus so far has largely been on performance improvement, with little effort in making cGANs more robust to noise. However, the regression (of the generator) might lead to arbitrarily large errors in the output, which makes cGANs unreliable for real-world applications. In this work, we introduce a novel conditional GAN model, called RoCGAN, which leverages structure in the target space of the model to address the issue. Specifically, we augment the generator with an unsupervised pathway, which promotes the outputs of the generator to span the target manifold, even in the presence of intense noise. We prove that RoCGAN share similar theoretical properties as GAN and establish with both synthetic and real data the merits of our model. We perform a thorough experimental validation on large scale datasets for natural scenes and faces and observe that our model outperforms existing cGAN architectures by a large margin. We also empirically demonstrate the performance of our approach in the face of two types of noise (adversarial and Bernoulli).

show abstract

Disentangling Geometry and Appearance with Regularised Geometry-Aware Generative Adversarial Networks

Cited by 18 publications

References 37 publications

Unsupervised Controllable Generation with Self-Training

Unsupervised Controllable Generation with Self-Training

Policy-Gradient and Actor-Critic Based State Representation Learning for Safe Driving of Autonomous Vehicles

RoCGAN: Robust Conditional GAN

Contact Info

Product

Resources

About