2021
DOI: 10.48550/arxiv.2110.10874
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CNewSum: A Large-scale Chinese News Summarization Dataset with Human-annotated Adequacy and Deducibility Level

Abstract: Automatic text summarization aims to produce a brief but crucial summary for the input documents. Both extractive and abstractive methods have witnessed great success in English datasets in recent years. However, there has been a minimal exploration of text summarization in Chinese, limited by the lack of largescale datasets. In this paper, we present a large-scale Chinese news summarization dataset CNewSum, which consists of 304,307 documents and human-written summaries for the news feed. It has long document… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…We conduct experiments on CNNDM dataset (Hermann et al 2015), NYT dateset (Durrett, Berg-Kirkpatrick, and Klein 2016) and CNewSum dataset (Wang et al 2021)…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We conduct experiments on CNNDM dataset (Hermann et al 2015), NYT dateset (Durrett, Berg-Kirkpatrick, and Klein 2016) and CNewSum dataset (Wang et al 2021)…”
Section: Methodsmentioning
confidence: 99%
“…Comparison of unsupervised methods. For existing methods, the results of TextRank (SimCSE) is re-conducted by us, while other results are reported in existing papers(Zheng and Lapata 2019;Xu et al 2020b;Wang et al 2021). …”
mentioning
confidence: 99%