“…We developed a taxonomy of nine reasons for why answers may differ, which are summarized in Table 1. Six of the nine reasons are inspired by the crowdsourcing literature -INV [32], DFF [44], AMB [24,26,43], SBJ [33,43,9], SYN [32], and SPM [41,42,14,15]. Two of the reasons are inspired by prior visual question answering work [20] -LQI and IVE.…”