Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-srw.30
|View full text |Cite
|
Sign up to set email alerts
|

SumPubMed: Summarization Dataset of PubMed Scientific Articles

Abstract: Most earlier work on text summarization is carried out on news article datasets. The summary in these datasets is naturally located at the beginning of the text. Hence, a model can spuriously utilize this correlation for summary generation instead of truly learning to summarize. To address this issue, we constructed a new dataset, SUMPUBMED, using scientific articles from the PubMed archive. We conducted a human analysis of summary coverage, redundancy, readability, coherence, and informativeness on SUMPUBMED.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(13 citation statements)
references
References 17 publications
0
13
0
Order By: Relevance
“…NLP is also positioned to address challenges in interpreting documentation in the EHR by facilitating improved communication and understanding of clinician notes. NLP techniques centered around word embeddings have recently been utilized to develop question-answering ( 111 – 114 ), as well as summarizing large bodies of text such as clinical notes ( 52 – 54 ), and scientific publications ( 55 , 115 ). With the advent of the Open Notes movement, a movement supporting transparent documentation among patients, families, and clinicians ( 116 118 ), and the 21st Century Cures Act of 2021 ( 119 ), which mandated patient accessibility to their clinical notes, there has been an increasing emphasis on patient involvement and advocacy in their own care.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…NLP is also positioned to address challenges in interpreting documentation in the EHR by facilitating improved communication and understanding of clinician notes. NLP techniques centered around word embeddings have recently been utilized to develop question-answering ( 111 – 114 ), as well as summarizing large bodies of text such as clinical notes ( 52 – 54 ), and scientific publications ( 55 , 115 ). With the advent of the Open Notes movement, a movement supporting transparent documentation among patients, families, and clinicians ( 116 118 ), and the 21st Century Cures Act of 2021 ( 119 ), which mandated patient accessibility to their clinical notes, there has been an increasing emphasis on patient involvement and advocacy in their own care.…”
Section: Discussionmentioning
confidence: 99%
“…been utilized to develop question-answering (111-114), as well as summarizing large bodies of text such as clinical notes (52-54), and scientific publications (55,115). With the advent of the Open Notes movement, a movement supporting transparent documentation among patients, families, and clinicians (116)(117)(118), and the 21st Century Cures Act of 2021 (119), which mandated patient accessibility to their clinical notes, there has been an increasing emphasis on patient involvement and advocacy in their own care.…”
Section: Discussionmentioning
confidence: 99%
“…Among them is the open-source data set of biomedical research papers collected by PubMed ( 46 ). This data set is widely used for a great variety of NLP tasks including text summarization ( 47 ) as well as general-purpose transformer pre-training ( 48 ). Furthermore, the vast amounts of social media data have proven resourceful in analyzing real-world data of post-acute COVID-19 ( 49 ).…”
Section: Ai For Text and Sequence-based Infection Biology Datamentioning
confidence: 99%
“…QuestEval examines to what extent the questions from a candidate summary can be answered by its source, taking into account the named entities and nouns from the source document as the ground-truth answers 5 .…”
Section: B Evaluation Metricsmentioning
confidence: 99%
“…the indications for surgery were an aortic stenosis with valve area less than . 5 Table IV shows that these abstractive summarization model. can recapitulate the critical phrases as bullet points.…”
Section: E Case Studymentioning
confidence: 99%