Towards automatically generating Questions under Discussion to link information and discourse structure

Kuthy, Kordula De; Kannan, Madeeswaran; Ponnusamy, Haemanth Santhi; Meurers, Detmar

doi:10.18653/v1/2020.coling-main.509

Cited by 4 publications

(23 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The generated question answer pairs in the QUACC corpus all satisfy the requirement of question answer congruence and as shown in De Kuthy et al (2020), this data set is a good source for training and testing of question generation approaches. De Kuthy et al ( 2020), Kannan et al (2021) trained word, character and subword seq2seq models successfully generating questions that satisfy question answer congruence.…”

Section: Corpus Creationmentioning

confidence: 95%

“…In the following, we will use this cleaned corpus to evaluate a number of models that have been shown to provide good results for the task of QG with QA congruence. Our goal is to show that our clean QUACC data set enables different types of neural models to produce questions of even higher quality compared to the numbers that were presented in De Kuthy et al (2020) and Kannan et al (2021) where various models were only trained and tested on the unclean QUACC.…”

Section: Training a Neural Classifiermentioning

confidence: 98%

“…It requires data sets that contain question-answer pairs with explicit question answer congruence. First approaches exploring question generation under the perspective of question answer congruence are presented in the work of De Kuthy et al ( 2020 ) and Kannan et al ( 2021 ). Based on a newly created data set several word-based, character, and subword seq2seq models are trained and tested that successfully generate questions satisfying question answer congruence, i.e., questions that can be answered with the sentences given in the input.…”

Section: Introductionmentioning

confidence: 99%

“…To address this lack of data, this paper introduces QUACC, the Question Answer Congruence Corpus, a corpus of 5.3 millions question-answer pairs obtained from a German newspaper corpus, designed explicitly for the task of QG with direct question answer congruence. A first version of this corpus was presented in De Kuthy et al ( 2020 ). While they focused on the quality of the neural question generation models, they did not further investigate the quality of the newly created data set.…”

Section: Introductionmentioning

confidence: 99%

“…While they focused on the quality of the neural question generation models, they did not further investigate the quality of the newly created data set. Since neural models are very sensitive to the quality of the data, some of the quality issues observed by De Kuthy et al ( 2020 ), such as generation of incorrect question words, seem to be related to the errors in the data set. We therefore developed method to clean the original QUACC data set which will be discussed in this article.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Exploring neural question generation for formal pragmatics: Data set and model evaluation

Kuthy

Kannan²,

Ponnusamy³

et al. 2022

Front. Artif. Intell.

Self Cite

View full text Add to dashboard Cite

We provide the first openly-available German QUestion-Answer Congruence Corpus (QUACC), designed for the task of sentence-based question generation with question-answer congruence. Based on this corpus, we establish suitable baselines for question generation, comparing systems of very different nature. Question generation is an interesting challenge in particular for current neural network architectures given that it combines aspects of language meaning and forms in complex ways. The systems have to generate question phrases appropriately linking to the meaning of the envisaged answer phrases, and they have to learn to generate well-formed questions using the source. We show that our QUACC corpus is well-suited to investigate the performance of various neural models and gain insights about the specific error sources.

show abstract

Section: Corpus Creationmentioning

confidence: 95%

Section: Training a Neural Classifiermentioning

confidence: 98%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Exploring neural question generation for formal pragmatics: Data set and model evaluation

Kuthy

Kannan²,

Ponnusamy³

et al. 2022

Front. Artif. Intell.

Self Cite

View full text Add to dashboard Cite

show abstract

Learning Enhancement Using Question-Answer Generation for e-Book Using Contrastive Fine-Tuned T5

Kumar¹,

Chauhan²,

C³

2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Conditional Generation with a Question-Answering Blueprint

Narayan

Maynez

Amplayo

et al. 2023

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

The ability to convey relevant and faithful information is critical for many tasks in conditional generation and yet remains elusive for neural seq-to-seq models whose outputs often reveal hallucinations and fail to correctly cover important details. In this work, we advocate planning as a useful intermediate representation for rendering conditional generation less opaque and more grounded. We propose a new conceptualization of text plans as a sequence of question-answer (QA) pairs and enhance existing datasets (e.g., for summarization) with a QA blueprint operating as a proxy for content selection (i.e., what to say) and planning (i.e., in what order). We obtain blueprints automatically by exploiting state-of-the-art question generation technology and convert input-output pairs into input-blueprint-output tuples. We develop Transformer-based models, each varying in how they incorporate the blueprint in the generated output (e.g., as a global plan or iteratively). Evaluation across metrics and datasets demonstrates that blueprint models are more factual than alternatives which do not resort to planning and allow tighter control of the generation output.

show abstract

Towards automatically generating Questions under Discussion to link information and discourse structure

Cited by 4 publications

References 37 publications

Exploring neural question generation for formal pragmatics: Data set and model evaluation

Exploring neural question generation for formal pragmatics: Data set and model evaluation

Learning Enhancement Using Question-Answer Generation for e-Book Using Contrastive Fine-Tuned T5

Conditional Generation with a Question-Answering Blueprint

Contact Info

Product

Resources

About