Proceedings of the 2nd Workshop on New Frontiers in Summarization 2019
DOI: 10.18653/v1/d19-5415
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Discourse-Level Segmentation for Extractive Summarization

Abstract: Extractive summarization selects and concatenates the most essential text spans in a document. Most, if not all, neural approaches use sentences as the elementary unit to select content for summarization. However, semantic segments containing supplementary information or descriptive details are often nonessential in the generated summaries. In this work, we propose to exploit discourse-level segmentation as a finer-grained means to more precisely pinpoint the core content in a document. We investigate how the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 23 publications
(19 citation statements)
references
References 25 publications
0
18
0
1
Order By: Relevance
“…C ¸elikyilmaz et al ( 2018) introduced a hierarchical model that handles the encoding phase through collaborating agents responsible for processing each text subsection. Liu and Chen (2019) and Xu et al (2020) proposed to exploit the discourse segmentation to extract the salient content for extractive summarization. Gidiotis and Tsoumakas (2020) introduced a divide-and-conquer approach that relies on structured documents to summarize each section independently.…”
Section: Related Workmentioning
confidence: 99%
“…C ¸elikyilmaz et al ( 2018) introduced a hierarchical model that handles the encoding phase through collaborating agents responsible for processing each text subsection. Liu and Chen (2019) and Xu et al (2020) proposed to exploit the discourse segmentation to extract the salient content for extractive summarization. Gidiotis and Tsoumakas (2020) introduced a divide-and-conquer approach that relies on structured documents to summarize each section independently.…”
Section: Related Workmentioning
confidence: 99%
“…In text summarization, most benchmark datasets focus on the news domain, such as NYT (Sandhaus, 2008) and CNN/Daily Mail (Hermann et al, 2015), where the human-written summaries are used in both abstractive and extractive paradigms (Gehrmann et al, 2018). To improve the performance of extractive summarization, non-neural approaches explore various linguistic and statistical features such as lexical characteristics (Kupiec et al, 1995), latent topic information (Ying-Lang Chang and Chien, 2009), discourse analysis (Hirao et al, 2015;Liu and Chen, 2019), and graphbased modeling (Erkan and Radev, 2004;Mihalcea and Tarau, 2004) . In contrast, neural approaches learn the features in a data-driven manner.…”
Section: In Relation To Other Workmentioning
confidence: 99%
“…Several studies [35,37] set up document graphs according to discourse analysis. But, this method relies on external tools, which possibly bring out semantically fragmented outputs [20]. To sum up, GAT (Graph Attention Network) is constructed using sentence context representation and topic information, and the docu-ment context vector and topic information are updated simultaneously.…”
Section: Introductionmentioning
confidence: 99%