Findings of the Association for Computational Linguistics: EMNLP 2021 2021
DOI: 10.18653/v1/2021.findings-emnlp.392
|View full text |Cite
|
Sign up to set email alerts
|

Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework

Abstract: Answering questions asked from instructional corpora such as E-manuals, recipe books, etc., has been far less studied than open-domain factoid context-based question answering. This can be primarily attributed to the absence of standard benchmark datasets. In this paper we meticulously create a large amount of data connected with E-manuals and develop suitable algorithm to exploit it. We collect E-Manual Corpus, a huge corpus of 307,957 E-manuals and pretrain RoBERTa on this large corpus. We create various ben… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…Different from previous multimodal inputs, the product manual is a specific domain in terms of the question type and the content. Since product manuals usually contain detailed operation instructions for a specific device, the questions beginning with 'How to' are very common (Nandy et al 2021), while this type of contents and questions rarely occur in general domain datasets. Moreover, the answers in the abovementioned works are all in text format, including text span, multi-choice, and generative sentences.…”
Section: Multimodal Question Answeringmentioning
confidence: 99%
See 4 more Smart Citations
“…Different from previous multimodal inputs, the product manual is a specific domain in terms of the question type and the content. Since product manuals usually contain detailed operation instructions for a specific device, the questions beginning with 'How to' are very common (Nandy et al 2021), while this type of contents and questions rarely occur in general domain datasets. Moreover, the answers in the abovementioned works are all in text format, including text span, multi-choice, and generative sentences.…”
Section: Multimodal Question Answeringmentioning
confidence: 99%
“…The product manuals in PM209 are from two sources: 1) E-manual corpus (Nandy et al 2021); 2) official websites of the brands.…”
Section: A Product Manual Collectionmentioning
confidence: 99%
See 3 more Smart Citations