Sentence Centrality Revisited for Unsupervised Summarization

Zheng, Hao; Lapata, Mirella

doi:10.18653/v1/p19-1628

Cited by 149 publications

(157 citation statements)

References 33 publications

Supporting

Mentioning

153

Contrasting

Unclassified

Order By: Relevance

“…(2) Unsupervised extractive systems: TextRank (Mihalcea and Tarau, 2004), Lead-X. (3) Supervised abstractive and abstractive (models trained with groundtruths summaries): PACSUM (Zheng and Lapata, 2019), PGNet (See et al, 2017), REFRESH (Narayan et al, 2018) and SUMO (Liu et al, 2019b). TED is unsupervised abstractive and therefore not directly comparable with supervised baselines.…”

Section: Baseline and Metricsmentioning

confidence: 99%

“…The centrality of a node (sentence) is computed by PageRank (Brin and Page, 1998) to decide whether a sentence should be included in the final summary. Zheng and Lapata (2019) advances upon TextRank by encoding sentences with BERT representation (Devlin et al, 2018) to compute pairs similarity and build graphs with directed edges decided by the relative positions of sentences.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising

Yang¹,

Zhu²,

Gmyr³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Text summarization aims to extract essential information from a piece of text and transform the text into a concise version. Existing unsupervised abstractive summarization models leverage recurrent neural networks framework while the recently proposed transformer exhibits much more capability. Moreover, most of previous summarization models ignore abundant unlabeled corpora resources available for pretraining. In order to address these issues, we propose TED, a transformerbased unsupervised abstractive summarization system with pretraining on large-scale data. We first leverage the lead bias in news articles to pretrain the model on millions of unlabeled corpora. Next, we finetune TED on target domains through theme modeling and a denoising autoencoder to enhance the quality of generated summaries. Notably, TED outperforms all unsupervised abstractive baselines on NYT, CNN/DM and English Gigaword datasets with various document styles. Further analysis shows that the summaries generated by TED are highly abstractive, and each component in the objective function of TED is highly effective.

show abstract

Section: Baseline and Metricsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising

Yang¹,

Zhu²,

Gmyr³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

show abstract

“…Then PageRank (Page et al, 1999) is employed to determine the final ranking scores for sentences. Zheng and Lapata (2019) builds directed graph by utilizing BERT (Devlin et al, 2019) to compute sentence similarities. The importance score of a sentence is the weighted sum of all its out edges, where weights for edges between the current sentence and preceding sentences are negative.…”

Section: Related Workmentioning

confidence: 99%

“…Thus, leading sentences tend to obtain high scores. Unlike Zheng and Lapata (2019), sentence positions are not explicitly modeled in our model and therefore our model is less dependent on sentence positions (as shown in experiments).…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers

Xu¹,

Zhang²,

Wu³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Unsupervised extractive document summarization aims to select important sentences from a document without using labeled summaries during training. Existing methods are mostly graph-based with sentences as nodes and edge weights measured by sentence similarities. In this work, we find that transformer attentions can be used to rank sentences for unsupervised extractive summarization. Specifically, we first pre-train a hierarchical transformer model using unlabeled documents only. Then we propose a method to rank sentences using sentence-level self-attentions and pre-training objectives. Experiments on CNN/DailyMail and New York Times datasets show our model achieves state-of-the-art performance on unsupervised summarization. We also find in experiments that our model is less dependent on sentence positions. When using a linear combination of our model and a recent unsupervised model explicitly modeling sentence positions, we obtain even better results.

show abstract

Extraction and Portrait of Knowledge Points for Open Learning Resources

Jiang

et al. 2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Sentence Centrality Revisited for Unsupervised Summarization

Cited by 149 publications

References 33 publications

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising

Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers

Extraction and Portrait of Knowledge Points for Open Learning Resources

Contact Info

Product

Resources

About