“…In text summarization, most benchmark datasets focus on the news domain, such as NYT (Sandhaus, 2008) and CNN/Daily Mail (Hermann et al, 2015), where the human-written summaries are used in both abstractive and extractive paradigms (Gehrmann et al, 2018). To improve the performance of extractive summarization, non-neural approaches explore various linguistic and statistical features such as lexical characteristics (Kupiec et al, 1995), latent topic information (Ying-Lang Chang and Chien, 2009), discourse analysis (Hirao et al, 2015;Liu and Chen, 2019), and graphbased modeling (Erkan and Radev, 2004;Mihalcea and Tarau, 2004) . In contrast, neural approaches learn the features in a data-driven manner.…”