Disentangled Representation Learning for Non-Parallel Text Style Transfer

John, Vineet; Mou, Lili; Bahuleyan, Hareesh; Vechtomova, Olga

doi:10.18653/v1/p19-1041

Cited by 236 publications

(276 citation statements)

References 27 publications

Supporting

Mentioning

254

Contrasting

Order By: Relevance

“…Hu et al (2017) propose a new neural generative model which combines variational auto-encoders and holistic attribute discriminators for the effective imposition of semantic structures. Following their work, many methods (Fu et al, 2018;John et al, 2018;Zhang et al, 2018a,b) has been proposed based on standard encoder-decoder architecture.…”

Section: Related Workmentioning

confidence: 99%

Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Dai

Liang

Qiu

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

158

170

View full text Add to dashboard Cite

Disentangling the content and style in the latent space is prevalent in unpaired text style transfer. However, two major issues exist in most of the current neural models. 1) It is difficult to completely strip the style information from the semantics for a sentence. 2) The recurrent neural network (RNN) based encoder and decoder, mediated by the latent representation, cannot well deal with the issue of the long-term dependency, resulting in poor preservation of non-stylistic semantic content. In this paper, we propose the Style Transformer, which makes no assumption about the latent representation of source sentence and equips the power of attention mechanism in Transformer to achieve better style transfer and better content preservation. Source code will be available on Github 1 .

show abstract

Section: Related Workmentioning

confidence: 99%

Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Dai

Liang

Qiu

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

158

170

View full text Add to dashboard Cite

show abstract

“…where p(w|z syn ) is the predicted distribution. BoW has been explored by previous work (Weng et al, 2017;John et al, 2018), showing good ability of preserving semantics. For the syntactic space, the multi-task loss trains a model to predict syntax on z syn .…”

Section: Disentangling Syntax and Semantics Into Different Latent Spacesmentioning

confidence: 99%

“…In particular, we introduce two continuous latent variables to capture semantics and syntax, respectively. To separate the semantic and syntactic information from each other, we borrow the adversarial approaches from the text style-transfer research (Hu et al, 2017;Fu et al, 2018;John et al, 2018), but adapt it into our scenario of syntactic modeling. We also observe that syntax and semantics are highly interwoven, and therefore further propose an adversarial reconstruction loss to regularize the syntactic and se-mantic spaces.…”

Section: Introductionmentioning

confidence: 99%

Generating Sentences from Disentangled Syntactic and Semantic Spaces

Bao¹,

Zhou²,

Huang³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

Variational auto-encoders (VAEs) are widely used in natural language generation due to the regularization of the latent space. However, generating sentences from the continuous latent space does not explicitly model the syntactic information. In this paper, we propose to generate sentences from disentangled syntactic and semantic spaces. Our proposed method explicitly models syntactic information in the VAE's latent space by using the linearized tree sequence, leading to better performance of language generation. Additionally, the advantage of sampling in the disentangled syntactic and semantic latent spaces enables us to perform novel applications, such as the unsupervised paraphrase generation and syntaxtransfer generation. Experimental results show that our proposed model achieves similar or better performance in various tasks, compared with state-of-the-art related work.

show abstract

“…There are other perspectives on what constitutes a disentangled representation not addressed in this paper [1], [16], including definitions not statistical in nature, instead taking into account the manifold structure and symmetry transformations in data [1], [12], [20]. Other deep learning approaches to disentangling include the adversarial setting [21]- [23]. Disentangled representations have also been studied in supervised and semi-supervised contexts [24].…”

Section: Discussionmentioning

confidence: 99%

“…We can calculate the model posterior distribution p θ (z|x) at the network optimum, eqs. (22) and (23). Using Bayes' rule we find (see Appendix B-D)…”

Section: B Optimal β Values In An Analytically Tractable Modelmentioning

confidence: 91%

A Closer Look at Disentangling in β-VAE

Sikka

Zhong

Yin

et al. 2019

2019 53rd Asilomar Conference on Signals, Systems, and Computers

View full text Add to dashboard Cite

In many data analysis tasks, it is beneficial to learn representations where each dimension is statistically independent and thus disentangled from the others. If data generating factors are also statistically independent, disentangled representations can be formed by Bayesian inference of latent variables. We examine a generalization of the Variational Autoencoder (VAE), β-VAE, for learning such representations using variational inference. β-VAE enforces conditional independence of its bottleneck neurons controlled by its hyperparameter β. This condition is in general not compatible with the statistical independence of latents. By providing analytical and numerical arguments, we show that this incompatibility leads to a non-monotonic inference performance in β-VAE with a finite optimal β.

show abstract

Disentangled Representation Learning for Non-Parallel Text Style Transfer

Cited by 236 publications

References 27 publications

Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

Generating Sentences from Disentangled Syntactic and Semantic Spaces

A Closer Look at Disentangling in β-VAE

Contact Info

Product

Resources

About