2022
DOI: 10.1016/j.jbi.2022.103999
|View full text |Cite
|
Sign up to set email alerts
|

COVIDSum: A linguistically enriched SciBERT-based summarization model for COVID-19 scientific papers

Abstract: Graphical abstract COVIDSum ( COVID -19 scientific paper Sum marization) consists of four major modules: (1) Dataset Preprocessing, (2) Heuristic Sentence Extraction, (3) Word Cooccurrence Graph Construction, and (4) Linguistically Enriched Abstractive Summarization. The Data Preprocessing module retrieves abstract and textual content of each paper and removes papers which have missed abstracts or are not written in English languag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 26 publications
(29 citation statements)
references
References 23 publications
0
29
0
Order By: Relevance
“…For example, given the strong ability of ChatGPT to extract key points and understand sentences, we can foresee potential promising results in text summarization. Specifically, ChatGPT might be valuable for domain-specific science paper summarization [86] and clinical report summarization [87]. Publicly available domain-specific science paper summarization datasets and clinical report datasets are rare and are often provided at small scales due to privacy concerns and the need for expert knowledge to generate annotated summaries.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…For example, given the strong ability of ChatGPT to extract key points and understand sentences, we can foresee potential promising results in text summarization. Specifically, ChatGPT might be valuable for domain-specific science paper summarization [86] and clinical report summarization [87]. Publicly available domain-specific science paper summarization datasets and clinical report datasets are rare and are often provided at small scales due to privacy concerns and the need for expert knowledge to generate annotated summaries.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Since the majority of the preceding study evaluating transformer-based methods uses truncated documents [ 7 , 10 – 12 , 21 , 37 ], a corpus was also constructed for evaluating the proposed method by truncating long articles to 512 words long. In consideration that most of the salient knowledge of a research article is documented at the start of the research article, starting from the top of the article, we successively assimilate sentences to form one paragraph until 512 words long [ 1 ].…”
Section: Proposed Methodologymentioning
confidence: 99%
“…Globally, the SARS-CoV-2 virus had a destructive effect on communities since the upsurge of the COVID-19 pandemic in November 2019 [ 1 ]. Medical communities and researchers are under increased pressure to remain current with the articles due to the rapid growth of research articles [ 2 ].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the ClinicalRadioBERT is a specialized language model for radiation oncology [67]. Other applications include radiology report summarization [68], mental disorder detection [69], COVID-19 research summarization [70], and clinical information extraction [71]. However, the scale of existing language models is not comparable to recent developments such as ChatGPT and GPT-4.…”
Section: Applications Of Chatgpt/gpt-4mentioning
confidence: 99%