Abstractive Document Summarization with a Graph-Based Attentional Neural Model

Tan, Jiwei; Wan, Xiaojun; Xiao, Jianguo

doi:10.18653/v1/p17-1108

Cited by 301 publications

(283 citation statements)

References 25 publications

Supporting

Mentioning

282

Contrasting

Order By: Relevance

“…1 We select the MLE models with the lowest negative log-likelihood and the MLE+RL models with the highest ROUGE-L scores on a sample of validation data to evaluate on the test Model ROUGE-1 ROUGE-2 ROUGE-L SummaRuNNer (Nallapati et al, 2017) 39.60 16.20 35.30 graph-based attention (Tan et al, 2017) 38.01 13.90 34.00 pointer generator (See et al, 2017) 36.44 15.66 33.42 pointer generator + coverage (See et al, 2017) 39.53 17.28 36.38 controlled summarization with fixed values (Fan et al, 2017) 39.75 17.29 36.54 RL, with intra-attention (Paulus et al, 2018) 41.16 15.75 39.08 ML+RL, with intra-attention (Paulus et al, 2018) 39 Model Rouge-1 Rouge-2 Rouge-L ML, no intra-attention (Paulus et al, 2018) 44.26 27.43 40.41 RL, no intra-attention (Paulus et al, 2018) 47.22 30.51 43.27 ML+RL, no intra-attention (Paulus et al, 2018) 47 set. At test time, we use beam search of width 5 on all our models to generate final predictions.…”

Section: Methodsmentioning

confidence: 99%

Deep Communicating Agents for Abstractive Summarization

Çelikyılmaz

Bosselut

et al. 2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

295

253

View full text Add to dashboard Cite

We present deep communicating agents in an encoder-decoder architecture to address the challenges of representing a long document for abstractive summarization. With deep communicating agents, the task of encoding a long text is divided across multiple collaborating agents, each in charge of a subsection of the input text. These encoders are connected to a single decoder, trained end-to-end using reinforcement learning to generate a focused and coherent summary. Empirical results demonstrate that multiple communicating encoders lead to a higher quality summary compared to several strong baselines, including those based on a single encoder or multiple non-communicating encoders.

show abstract

Section: Methodsmentioning

confidence: 99%

Deep Communicating Agents for Abstractive Summarization

Çelikyılmaz

Bosselut

et al. 2018

Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu

295

253

View full text Add to dashboard Cite

show abstract

“…Distraction-M3 (Chen et al, 2016b) trains the summarization system to distract its attention to traverse different regions of the source article. Graph attention (Tan et al, 2017) introduces a graph-based attention mechanism to enhance the encoderdecoder framework. PointerGen+Cov.…”

Section: Resultsmentioning

confidence: 99%

Guiding Extractive Summarization with Question-Answering Rewards

Arumae¹,

Liu²

2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

Highlighting while reading is a natural behavior for people to track salient content of a document. It would be desirable to teach an extractive summarizer to do the same. However, a major obstacle to the development of a supervised summarizer is the lack of ground-truth. Manual annotation of extraction units is costprohibitive, whereas acquiring labels by automatically aligning human abstracts and source documents can yield inferior results. In this paper we describe a novel framework to guide a supervised, extractive summarization system with question-answering rewards. We argue that quality summaries should serve as a document surrogate to answer important questions, and such question-answer pairs can be conveniently obtained from human abstracts. The system learns to promote summaries that are informative, fluent, and perform competitively on question-answering. Our results compare favorably with those reported by strong summarization baselines as evaluated by automatic metrics and human assessors.

show abstract

“…The value of k, which denotes the number of sentences being selected as the basis of abstractive summarization process, is set to 5. We compare our model with two state-of-the-art approaches, i.e., graph-based attention model (GBA) [6] and pointer-generator network (PGN, without coverage mechanism) [4]. We have conducted preliminary experiments on the proposed model with the extractive We also perform human evaluation to evaluate output summaries.…”

Section: Methodsmentioning

confidence: 99%

An Effective Joint Framework for Document Summarization

Gui

Zhang

Yang

et al. 2018

Companion of the the Web Conference 2018 on the Web Conference 2018 - WWW '18

View full text Add to dashboard Cite

Document summarization is an important research issue and has attracted much attention from the academe. The approaches for document summarization can be classified as extractive and abstractive. In this work, we introduce an effective joint framework that integrates extractive and abstractive summarization models, which is much closer to the way human write summaries (first underlining important information). Preliminary experiments on real benchmark dataset demonstrate that our model is competitive with the state-of-the-art methods.

show abstract

Abstractive Document Summarization with a Graph-Based Attentional Neural Model

Cited by 301 publications

References 25 publications

Deep Communicating Agents for Abstractive Summarization

Deep Communicating Agents for Abstractive Summarization

Guiding Extractive Summarization with Question-Answering Rewards

An Effective Joint Framework for Document Summarization

Contact Info

Product

Resources

About