Improving Unsupervised Extractive Summarization with Facet-Aware Modeling

Liang, Xinnian; Wu, Shuangzhi; Li, Zhoujun

doi:10.18653/v1/2021.findings-acl.147

Cited by 25 publications

(29 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As there are many diferent ways to (a) encode or vectorize a sentence before calculating the similarity between them and (b) calculate the centrality score of each sentence, research involving this architecture often difers only in these two mechanisms. For example, with respect to the former mechanism, graph architecture in the past [29,83] encodes sentences based on word-occurrence or term frequency-inverse document frequency (Tf-Idf) while graph architecture today [69,135] encodes sentences with state-of-the-art pre-trained models. On the other hand, to improve the centrality scoring mechanism, PacSum [135] and FAR [69] adjust the centrality score of a sentence based on whether the other sentences come before or after it, while HipoRank [25] exploits the discourse structure contained in by adjusting the centrality score with positional and sectional bias.…”

Section: Graph Architecturementioning

confidence: 99%

“…For example, with respect to the former mechanism, graph architecture in the past [29,83] encodes sentences based on word-occurrence or term frequency-inverse document frequency (Tf-Idf) while graph architecture today [69,135] encodes sentences with state-of-the-art pre-trained models. On the other hand, to improve the centrality scoring mechanism, PacSum [135] and FAR [69] adjust the centrality score of a sentence based on whether the other sentences come before or after it, while HipoRank [25] exploits the discourse structure contained in by adjusting the centrality score with positional and sectional bias. In general form, given a set of sentences in the original source document, D = {s 1 , s 2 , ..., s m } with the inter-sentential similarity relations represented as e i j = (s i , s j ) ∈ E where i j, the following illustrates the aforementioned architecture in computing the scoring for each sentence:…”

Section: Graph Architecturementioning

confidence: 99%

See 1 more Smart Citation

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Koh

Liu

et al. 2022

ACM Comput. Surv.

View full text Add to dashboard Cite

Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively condense long documents into short and concise texts to encapsulate the most important information would thus be significant in aiding the reader’s comprehension. Recently, with the advent of neural architectures, significant research efforts have been made to advance automatic text summarization systems, and numerous studies on the challenges of extending these systems to the long document domain have emerged. In this survey, we provide a comprehensive overview of the research on long document summarization and a systematic evaluation across the three principal components of its research setting: benchmark datasets, summarization models, and evaluation metrics. For each component, we organize the literature within the context of long document summarization and conduct an empirical analysis to broaden the perspective on current research progress. The empirical analysis includes a study on the intrinsic characteristics of benchmark datasets, a multi-dimensional analysis of summarization models, and a review of the summarization evaluation metrics. Based on the overall findings, we conclude by proposing possible directions for future exploration in this rapidly growing field.

show abstract

Section: Graph Architecturementioning

confidence: 99%

Section: Graph Architecturementioning

confidence: 99%

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Koh

Liu

et al. 2022

ACM Comput. Surv.

View full text Add to dashboard Cite

show abstract

“…Residents are injured on graph [15,23,38], centrality [29,53], point-wise mutual information [37], or sentence-level self-attention in pre-trained models [45]. Another direction is unsupervised abstractive approaches, and these studies typically employ sequence-to-sequence autoencoding method [9] with adversarial training and reinforcement learning [42].…”

Section: Patternsmentioning

confidence: 99%

Unsupervised Multi-Granularity Summarization

Zhong¹,

Liu²,

Ge³

et al. 2022

Preprint

View full text Add to dashboard Cite

Text summarization is a personalized and customized task, i.e., for one document, users often have different preferences for the summary. As a key aspect of customization in summarization, granularity is used to measure the semantic coverage between summary and source document. Coarse-grained summaries can only contain the most central event in the original text, while fine-grained summaries cover more sub-events and corresponding details. However, previous studies mostly develop systems in the single-granularity scenario. And models that can generate summaries with customizable semantic coverage still remain an under-explored topic. In this paper, we propose the first unsupervised multi-granularity summarization framework, GranuSum. We take events as the basic semantic units of the source documents and propose to rank these events by their salience. We also develop a model to summarize input documents with given events as anchors and hints. By inputting different numbers of events, GranuSum is capable of producing multi-granular summaries in an unsupervised manner. Meanwhile, to evaluate multi-granularity summarization models, we annotate a new benchmark GranuDUC, in which we write multiple summaries of different granularities for each document cluster. Experimental results confirm the substantial superiority of GranuSum on multigranularity summarization over several baseline systems. Furthermore, by experimenting on conventional unsupervised abstractive summarization tasks, we find that GranuSum, by exploiting the event information, can also achieve new state-of-the-art results under this scenario, outperforming strong baselines.

show abstract

“…Recent work has seen the emergence of larger scale datasets such as WikiSum , Multi-News (Fabbri et al, 2019), andWCEP (Gholipour Ghalandari et al, 2020) to combat data sparsity. Extractive (Wang et al, 2020b,c;Liang et al, 2021) and abstractive (Jin et al, 2020) methods have followed from these multi-document news datasets.…”

Section: Related Workmentioning

confidence: 99%

Template-based Abstractive Microblog Opinion Summarization

Bilal

Wang

Tsakalidis

et al. 2022

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

We introduce the task of microblog opinion summarization (MOS) and share a dataset of 3100 gold-standard opinion summaries to facilitate research in this domain. The dataset contains summaries of tweets spanning a 2-year period and covers more topics than any other public Twitter summarization dataset. Summaries are abstractive in nature and have been created by journalists skilled in summarizing news articles following a template separating factual information (main story) from author opinions. Our method differs from previous work on generating gold-standard summaries from social media, which usually involves selecting representative posts and thus favors extractive summarization models. To showcase the dataset’s utility and challenges, we benchmark a range of abstractive and extractive state-of-the-art summarization models and achieve good performance, with the former outperforming the latter. We also show that fine-tuning is necessary to improve performance and investigate the benefits of using different sample sizes.

show abstract

Improving Unsupervised Extractive Summarization with Facet-Aware Modeling

Cited by 25 publications

References 51 publications

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Unsupervised Multi-Granularity Summarization

Template-based Abstractive Microblog Opinion Summarization

Contact Info

Product

Resources

About