“…For example, a glass may be used for drinking water, under an implicit assumption that the water is at normal temperature, but may not be if the glass is shattered. From the cognitive perspective, understanding the affordance of objects, or simply preconditions of actions (Qasemi et al, 2022a), is part of the commonsense knowledge that constitutes what distinguishes humans from a machine to make inference (Lenat, 1998). From an applications perspective, it also has huge implications such as robotics (Ahn et al, 2022), trans-Figure 1: Preconditioned Visual Language Inference (PVLI) and Preconditioned Visual Language Reasoning (PVLR) tasks.…”