“…Scene graphs [45], vmCAN [59], Graph-align [136], Know more say less [137], GCN-LSTM [15], SGAE [16], StructCap [138], GCH [139], GIN [140], Textual-GCNs [141], CSMN [31], CMMN [64], ReGAT [142], Out-of-the-box [143], Graph VQA [144], GERG [145], VKMN [146], MAN-VQA [147], DMN+ [148], MSCQA [116], SCH-GAN [82], CBT [149], SCST [17], CAVP [18], SR-PL [19], SMem-VQA [150], ODA [151], AOA [152], Up-Down [32], Attention-aware [79], BSSAN [62], CRAN [153], CBP [36], SOT [5], PAGNet [34], MirrorGAN [154], DAI [155], T2I2T [11], CCGAN [13], C4Synth [156], Cycle-Attn+ [157], Coupled CycleGAN [158], TCCM [159], CycleMatch [160], VQA-Rephrasings [161], iQAN [162], MLAN...…”