2024
DOI: 10.3390/jimaging10030056
|View full text |Cite
|
Sign up to set email alerts
|

Collaborative Modality Fusion for Mitigating Language Bias in Visual Question Answering

Qiwen Lu,
Shengbo Chen,
Xiaoke Zhu

Abstract: Language bias stands as a noteworthy concern in visual question answering (VQA), wherein models tend to rely on spurious correlations between questions and answers for prediction. This prevents the models from effectively generalizing, leading to a decrease in performance. In order to address this bias, we propose a novel modality fusion collaborative de-biasing algorithm (CoD). In our approach, bias is considered as the model’s neglect of information from a particular modality during prediction. We employ a c… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 25 publications
0
0
0
Order By: Relevance