Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence

Shen, Xiaoyu; Chang, Ernie; Su, Hui; Niu, Cheng; Klakow, Dietrich

doi:10.18653/v1/2020.acl-main.641

Cited by 38 publications

(37 citation statements)

References 50 publications

(50 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous methods usually treat the graphto-text generation as an end-to-end generation task. Those models (Trisedya et al, 2018;Gong et al, 2019;Shen et al, 2020) usually first lineralize the knowledge graph and then use attention mechanism to generate the description sentences. While the linearization of input graph may sacrifice the inter-dependency inside input graph, some papers (Ribeiro et al, 2019(Ribeiro et al, , 2020aZhao et al, 2020) Category Output use graph encoder such as GCN (Duvenaud et al, 2015) and graph transformer Koncel-Kedziorski et al, 2019) to encode the input graphs.…”

Section: Related Workmentioning

confidence: 99%

“…While the linearization of input graph may sacrifice the inter-dependency inside input graph, some papers (Ribeiro et al, 2019(Ribeiro et al, , 2020aZhao et al, 2020) Category Output use graph encoder such as GCN (Duvenaud et al, 2015) and graph transformer Koncel-Kedziorski et al, 2019) to encode the input graphs. Others (Shen et al, 2020; try to carefully design loss functions to control the generation quality. With the development of computation resources, large scale PLMs such as GPT-2 (Radford et al, 2019), BART (Lewis et al, 2020) and T5 (Raffel et al, 2020) achieve state-ofthe-art results even with simple linearized graph input (Harkous et al, 2020;Chen et al, 2020a;Kale, 2020;Ribeiro et al, 2020b).…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Stage-wise Fine-tuning for Graph-to-Text Generation

Wang¹,

Yavuz²,

Lin³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

Graph-to-text generation has benefited from pre-trained language models (PLMs) in achieving better performance than structured graph encoders. However, they fail to fully utilize the structure information of the input graph. In this paper, we aim to further improve the performance of the pre-trained language model by proposing a structured graph-to-text model with a two-step fine-tuning mechanism which first fine-tunes the model on Wikipedia before adapting to the graph-to-text generation. In addition to using the traditional token and position embeddings to encode the knowledge graph (KG), we propose a novel treelevel embedding method to capture the interdependency structures of the input graph. This new approach has significantly improved the performance of all text generation metrics for the English WebNLG 2017 dataset. 1 * This research was conducted during the author's internship at Salesforce Research.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Stage-wise Fine-tuning for Graph-to-Text Generation

Wang¹,

Yavuz²,

Lin³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

show abstract

“…However it trades the controllability for fluency. Similarly, Shen et al (2020) explicitly segment target text into fragment units, while aligning them with their corresponding input. Shao et al (2019) use a Hierarchical Variational Model to aggregate input items into a sequence of local latent variables and realize sentences conditioned on the aggregations.…”

Section: Related Workmentioning

confidence: 99%

“…In contrast, the plan used in other neural plan-based approaches is usually limited in terms of its interpretability, control, and expressivity. For example, in (Moryossef et al, 2019b;Zhao et al, 2020) the sentence plan is created independently, incurring error propagation; Wiseman et al (2018) use latent segmentation that limits interpretability; Shao et al (2019) sample from a latent variable, not allowing for explicit control; and Shen et al (2020) aggregate multiple input representations which limits expressiveness. AGGGEN explicitly models the two planning processes (ordering and aggregation), but can directly influence the resulting plan and generated target text, using a separate inference algorithm based on dynamic programming.…”

Section: Introductionmentioning

confidence: 99%

AggGen: Ordering and Aggregating while Generating

Xu¹,

Dusek²,

Rieser³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

We present AGGGEN (pronounced 'again'), a data-to-text model which re-introduces two explicit sentence planning stages into neural datato-text systems: input ordering and input aggregation. In contrast to previous work using sentence planning, our model is still endto-end: AGGGEN performs sentence planning at the same time as generating text by learning latent alignments (via semantic facts) between input representation and target text. Experiments on the WebNLG and E2E challenge data show that by using fact-based alignments our approach is more interpretable, expressive, robust to noise, and easier to control, while retaining the advantages of end-to-end systems in terms of fluency. Our code is available at https://github.com/XinnuoXu/ AggGen.

show abstract

“…The rise of pre-trained language models (Devlin et al, 2019;Radford et al, 2019) has led to strong text generation models for applications including summarization (Dong et al, 2019;, paraphrasing (Goyal and Durrett, 2020;Shen et al, 2020), story generation (Mao et al, 2019), and data augmentation Zhang and Bansal, 2019). However, while these models generate fluent and grammatical text, they are prone to making factual errors that contradict…”

Section: Introductionmentioning

confidence: 99%

Evaluating Factuality in Generation with Dependency-level Entailment

Goyal

Durrett

2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Despite significant progress in text generation models, a serious limitation is their tendency to produce text that is factually inconsistent with information in the input. Recent work has studied whether textual entailment systems can be used to identify factual errors; however, these sentence-level entailment models are trained to solve a different problem than generation filtering and they do not localize which part of a generation is non-factual. In this paper, we propose a new formulation of entailment that decomposes it at the level of dependency arcs. Rather than focusing on aggregate decisions, we instead ask whether the semantic relationship manifested by individual dependency arcs in the generated output is supported by the input. Human judgments on this task are difficult to obtain; we therefore propose a method to automatically create data based on existing entailment or paraphrase corpora. Experiments show that our dependency arc entailment model trained on this data can identify factual inconsistencies in paraphrasing and summarization better than sentencelevel methods or those based on question generation, while additionally localizing the erroneous parts of the generation. 1

show abstract

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence

Cited by 38 publications

References 50 publications

Stage-wise Fine-tuning for Graph-to-Text Generation

Stage-wise Fine-tuning for Graph-to-Text Generation

AggGen: Ordering and Aggregating while Generating

Evaluating Factuality in Generation with Dependency-level Entailment

Contact Info

Product

Resources

About