2020
DOI: 10.1109/access.2020.3020868
|View full text |Cite
|
Sign up to set email alerts
|

Generating Biomedical Question Answering Corpora From Q&A Forums

Abstract: Question Answering (QA) is a natural language processing task that aims at obtaining relevant answers to user questions. While some progress has been made in this area, biomedical questions are still a challenge to most QA approaches, due to the complexity of the domain and limited availability of training sets. We present a method to automatically extract question-article pairs from Q&A web forums, which can be used for document retrieval, a crucial step of most QA systems. The proposed framework extracts fro… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 23 publications
0
4
0
Order By: Relevance
“…PubMedQA ( Jin et al , 2019 ) created a QA dataset that can be used as yes/no or query-focused summarization. Cloze style QA datasets are also proposed in the domain of BioNLP ( Kim et al , 2018 ; Lamurias et al , 2020 ; Pappas et al , 2020 ).…”
Section: Related Workmentioning
confidence: 99%
“…PubMedQA ( Jin et al , 2019 ) created a QA dataset that can be used as yes/no or query-focused summarization. Cloze style QA datasets are also proposed in the domain of BioNLP ( Kim et al , 2018 ; Lamurias et al , 2020 ; Pappas et al , 2020 ).…”
Section: Related Workmentioning
confidence: 99%
“…To test our framework, we made adjustments (see Appendix A) to four biomedical datasets: BioASQ (Lamurias et al, 2020), COVID-QA (Möller et al, 2020), cpgQA (Mahbub et al, 2023) and SleepQA (Bojic et al, 2022). We refer the reader to Table 1 for statistics on the final version of datasets that we used in all experiments: original/final size of text corpus, original/final number of labels and finally, train/dev/test split.…”
Section: Datasetsmentioning
confidence: 99%
“…Pub-MedQA (Jin et al, 2019) created a QA dataset that can be used as yes/no or query-focused summarization. Cloze style QA datasets are also proposed in the domain of BioNLP (Kim et al, 2018;Lamurias et al, 2020;Pappas et al, 2020).…”
Section: Questions For Question Answeringmentioning
confidence: 99%