Target Conditioning for One-to-Many Generation

Lachaux, Marie-Anne; Joulin, Armand; Lample, Guillaume

doi:10.18653/v1/2020.findings-emnlp.256

Cited by 13 publications

(9 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Generating multiple valid outputs given a source sequence has a wide range of applications, such as machine translation (Shen et al, 2019), paraphrase generation (Gupta et al, 2018), question generation (Cho et al, 2019), dialogue system (Dou et al, 2021), and story generation . For example, in machine translation, there are often many plausible and semantically equivalent translations due to information asymmetry between different languages (Lachaux et al, 2020).…”

Section: Diversity Promoting Text Generationmentioning

confidence: 99%

“…For example, nucleus sampling samples next tokens from the dynamic nucleus of tokens containing the vast majority of the probability mass, instead of decoding text by maximizing the likelihood. Another line of work focused on introducing random noise (Gupta et al, 2018) or changing latent variables (Lachaux et al, 2020) to produce uncertainty. In addition, Shen et al (2019) adopted a mixture of experts to diversify machine translation, where a minimum-loss predictor is assigned to each source input.…”

Section: Diversity Promoting Text Generationmentioning

confidence: 99%

See 1 more Smart Citation

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

Yu¹,

C²,

Qin³

et al. 2022

Findings of the Association for Computational Linguistics: ACL 2022

View full text Add to dashboard Cite

Generative commonsense reasoning (GCR) in natural language is to reason about the commonsense while generating coherent text. Recent years have seen a surge of interest in improving the generation quality of commonsense reasoning tasks. Nevertheless, these approaches have seldom investigated diversity in the GCR tasks, which aims to generate alternative explanations for a real-world situation or predict all possible outcomes. Diversifying GCR is challenging as it expects to generate multiple outputs that are not only semantically different but also grounded in commonsense knowledge. In this paper, we propose MoKGE, a novel method that diversifies the generative reasoning by a mixture of expert (MoE) strategy on commonsense knowledge graphs (KG). A set of knowledge experts seek diverse reasoning on KG to encourage various generation outputs. Empirical experiments demonstrated that MoKGE can significantly improve the diversity while achieving on par performance on accuracy on two GCR benchmarks, based on both automatic and human evaluations.

show abstract

Section: Diversity Promoting Text Generationmentioning

confidence: 99%

Section: Diversity Promoting Text Generationmentioning

confidence: 99%

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

Yu¹,

C²,

Qin³

et al. 2022

Findings of the Association for Computational Linguistics: ACL 2022

View full text Add to dashboard Cite

show abstract

“…Further, Lachaux et al (2020) replace the syntactic codes with latent domain variables derived from target sentences, which is more computationally efficient. Sun et al (2020) sample the encoderdecoder attention heads of Transformer to affect source word selection.…”

Section: Related Workmentioning

confidence: 99%

Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation

Lin¹,

Yang²,

Yao³

et al. 2022

Findings of the Association for Computational Linguistics: NAACL 2022

View full text Add to dashboard Cite

Diverse NMT aims at generating multiple diverse yet faithful translations given a source sentence. In this paper, we investigate a common shortcoming in existing diverse NMT studies: the model is usually trained with single reference, while expected to generate multiple candidate translations in inference. The discrepancy between training and inference enlarges the confidence variance and quality gap among candidate translations and thus hinders model performance. To deal with this defect, we propose a multi-candidate optimization framework for diverse NMT. Specifically, we define assessments to score the diversity and the quality of candidate translations during training, and optimize the diverse NMT model with two strategies based on reinforcement learning, namely hard constrained training and soft constrained training. We conduct experiments on NIST Chinese-English and WMT14 English-German translation tasks. The results illustrate that our framework is transparent to basic diverse NMT models, and universally makes better trade-off between diversity and quality. Our source code is available at https://github.com/ DeepLearnXMU/MultiCanOptim.

show abstract

“…Diverse Text Generation. Generating diverse sequences is of crucial importance in many text generation applications that exhibit semantically oneto-many relationships between source and the target sequences, such as machine translation (Shen et al, 2019;Lachaux et al, 2020), summarization (Cho et al, 2019), question generation , and paraphrase generation (Qian et al, 2019). Methods of improving diversity in text generation that have been widely explored from different perspectives in recent years.…”

Section: Related Workmentioning

confidence: 99%

“…Samplingbased decoding is one of the effective solutions to improve diversity (Fan et al, 2018;Holtzman et al, 2020), e.g., nucleus sampling (Holtzman et al, 2020) samples next tokens from the dynamic nucleus of tokens containing the vast majority of the probability mass, instead of aiming to decode text by maximizing the likelihood. Another line of work focuses on introducing random noise (Gupta et al, 2018) or changing latent variable (Lachaux et al, 2020) to produce uncertainty, e.g., Gupta et al (2018) employ a variational auto-encoder framework to generate diverse paraphrases according to the input noise. In addition, Shen et al (2019) adopt a deep mixture of experts (MoE) to diversify machine translation, where a minimum-loss predictor is assigned to each source input; Shi et al (2018) employ inverse reinforcement learning for unconditional diverse text generation.…”

Section: Related Workmentioning

confidence: 99%

Sentence-Permuted Paragraph Generation

Yu¹,

Zhu²,

Zhao³

et al. 2021

Preprint

View full text Add to dashboard Cite

Generating paragraphs of diverse contents is important in many applications. Existing generation models produce similar contents from homogenized contexts due to the fixed left-toright sentence order. Our idea is permuting the sentence orders to improve the content diversity of multi-sentence paragraph. We propose a novel framework PermGen whose objective is to maximize the expected log-likelihood of output paragraph distributions with respect to all possible sentence orders. PermGen uses hierarchical positional embedding and designs new procedures for training, decoding, and candidate ranking in the sentence-permuted generation. Experiments on three paragraph generation benchmarks demonstrate PermGen generates more diverse outputs with a higher quality than existing models.

show abstract

Target Conditioning for One-to-Many Generation

Cited by 13 publications

References 24 publications

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation

Sentence-Permuted Paragraph Generation

Contact Info

Product

Resources

About