SemEval-2022 Task 9: R2VQ – Competence-based Multimodal Question Answering

Tu, Jingxuan; Holderness, Eben; Maru, Marco; Conia, Simone; Rim, Kyeongmin; Lynch, Kelley; Brutti, Richard; Navigli, Roberto; Pustejovsky, James

doi:10.18653/v1/2022.semeval-1.176

Cited by 3 publications

(4 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Verbs from the analyzed part are collected using either SRL (more specifically, tokens labeled as B-V), or a CRL and SRL combination, namely finding B-EVENT (CRL) with corresponding SRL (I-V or D-V). A detailed description of the annotation system is presented in Tu et al (2022).…”

Section: Intent Identificationmentioning

confidence: 99%

See 1 more Smart Citation

Samsung Research Poland (SRPOL) at SemEval-2022 Task 9: Hybrid Question Answering Using Semantic Roles

Dryjanski¹,

Załęska²,

Kuźma³

et al. 2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

View full text Add to dashboard Cite

In this work we present an overview of our winning system for the R2VQ -Competencebased Multimodal Question Answering task, with the final exact match score of 92.53%. The task is structured as question-answer pairs, querying how well a system is capable of competence-based comprehension of recipes. We propose a hybrid of a rule-based system, Question Answering Transformer, and a neural classifier for N/A answers recognition. The rule-based system focuses on intent identification, data extraction and response generation.

show abstract

Section: Intent Identificationmentioning

confidence: 99%

“…The goal of the task 1 was to develop a system applying existing knowledge to new situations, demonstrating a kind of understanding of a real-world domain. The competition presents a QA 2 challenge requiring linguistic and cognitive competencies that humans have while speaking and reasoning (Tu et al, 2022).…”

Section: Introductionmentioning

confidence: 99%

Samsung Research Poland (SRPOL) at SemEval-2022 Task 9: Hybrid Question Answering Using Semantic Roles

Dryjanski¹,

Załęska²,

Kuźma³

et al. 2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

View full text Add to dashboard Cite

show abstract

“…The R2VQ (Tu et al, 2022) task proposed the use of multimodal models to leverage both text and image for QA in the context of recipes. The R2VQ task adopts the definition of 'Question Family' from the CLEVR dataset (Johnson et al, 2017), where each type of question-answer pair comes from a template identified by task organisers.…”

Section: Task and Datamentioning

confidence: 99%

HIT&QMUL at SemEval-2022 Task 9: Label-Enclosed Generative Question Answering (LEG-QA)

Zhai¹,

Feng²,

Zubiaga³

et al. 2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

View full text Add to dashboard Cite

This paper presents the second place system for the R2VQ: competence-based multimodal question answering shared task. The task consisted in building question answering systems that could process procedural recipes involving both text and image, and enriched with semantic and cooking roles. We tackled the task by using a text-to-text generative model based on the transformer architecture, with the aim of generalising across different question types. Our proposed architecture incorporates a novel approach for enriching input texts by incorporating semantic and cooking role labels through what we call Label-Enclosed Generative Question Answering (LEG-QA). Our model achieves a score of 91.3, with a significant improvement over the baseline (65.34) and close to the top-ranked system ((92.5). After describing the submitted system, we analyse the impact of the different components of LEG-QA as well as perform an error analysis.

show abstract

“…In this paper, we discuss an approach to the Question Answering (QA) task for SemEval-2022 Task9 (Tu et al, 2022). This task is structured as question answering pairs, querying how well a system understands the semantics of recipes derived from a collection of English cooking recipes and videos, which involve rich semantic annotation and aligned text-video objects.…”

Section: Introductionmentioning

confidence: 99%

PINGAN_AI at SemEval-2022 Task 9: Recipe knowledge enhanced model applied in Competence-based Multimodal Question Answering

Ruan¹,

Hou²,

Jiang³

2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

View full text Add to dashboard Cite

This paper describes our system used in the SemEval-2022 Task 09: R2VQ -Competencebased Multimodal Question Answering. We propose a knowledge-enhanced model for predicting answer in QA task, this model use BERT as the backbone. We adopted two knowledge-enhanced methods in this model: the knowledge auxiliary text method and the knowledge embedding method. We also design an answer extraction task pipeline, which contains an extraction-based model, an automatic keyword labeling module, and an answer generation module. Our system ranked 3rd in task 9 and achieved an exact match score of 78.21 and a word-level F1 score of 82.62.

show abstract

SemEval-2022 Task 9: R2VQ – Competence-based Multimodal Question Answering

Cited by 3 publications

References 24 publications

Samsung Research Poland (SRPOL) at SemEval-2022 Task 9: Hybrid Question Answering Using Semantic Roles

Samsung Research Poland (SRPOL) at SemEval-2022 Task 9: Hybrid Question Answering Using Semantic Roles

HIT&QMUL at SemEval-2022 Task 9: Label-Enclosed Generative Question Answering (LEG-QA)

PINGAN_AI at SemEval-2022 Task 9: Recipe knowledge enhanced model applied in Competence-based Multimodal Question Answering

Contact Info

Product

Resources

About