Would You Ask it that Way? Measuring and Improving Question Naturalness for Knowledge Graph Question Answering

Linjordet, Trond; Balog, Krisztian

doi:10.48550/arxiv.2205.12768

Cited by 1 publication

(1 citation statement)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While, there are fluctuations in trends, we note that for all the datasets, "coherence" is significantly and negatively correlated with performance. This observation aligns with prior findings of Linjordet and Balog (2022) where the fluency and naturalness of questions degrades KBQA performance. Moreover, the negative correlation with #Z implies that questions with a greater proportion of unseen classes and relations are harder for models to answer.…”

Section: Rq2 Do Models Exhibit Similar Performance On Different Isomo...supporting

confidence: 91%

GrailQA++: A Challenging Zero-Shot Benchmark for Knowledge Base Question Answering

Dutt,

Khosla,

Bannihatti Kumar

et al. 2023

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacifi

View full text Add to dashboard Cite

Most benchmarks designed for question answering over knowledge bases (KBQA) operate with the i.i.d. assumption where one encounters the same schema items during inference as those observed during training. Recently, the GrailQA dataset was established to evaluate zero-shot generalization capabilities of KBQA models as a departure from the i.i.d. assumption. Reasonable performance of current KBQA systems on the zero-shot GrailQA split hints that the field might be moving towards more generalizable systems. In this work, we observe a bias in the GrailQA dataset towards simpler one or two-hop questions, which results in an inaccurate assessment of the aforementioned prowess. We propose GrailQA++, a challenging zero-shot KBQA test set that contains more questions relying on complex reasoning. We leverage the concept of graph isomorphisms to control the complexity of the questions and to ensure that our proposed test set has a fair distribution of simple and complex questions. Existing KBQA models suffer a substantial drop in performance on our constructed new test set as compared to the GrailQA zero-shot split. Our analysis reveals how isomorphisms can be used to understand the complementary strengths of different KBQA models and provide a deeper insight into model mispredictions. Overall, our paper highlights the non-generalizability of existing models and the necessity for designing more challenging benchmarks. Our dataset is available at https://github.com/sopankhosla/ GrailQA-PlusPlus

show abstract

Section: Rq2 Do Models Exhibit Similar Performance On Different Isomo...supporting

confidence: 91%

GrailQA++: A Challenging Zero-Shot Benchmark for Knowledge Base Question Answering

Dutt,

Khosla,

Bannihatti Kumar

et al. 2023

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacifi

View full text Add to dashboard Cite

show abstract

Would You Ask it that Way? Measuring and Improving Question Naturalness for Knowledge Graph Question Answering

Cited by 1 publication

References 16 publications

GrailQA++: A Challenging Zero-Shot Benchmark for Knowledge Base Question Answering

GrailQA++: A Challenging Zero-Shot Benchmark for Knowledge Base Question Answering

Contact Info

Product

Resources

About