“…In other words, they can only memorize token patterns from existing corpus rather than understand the rationale behind the language context, thus would fail to perform deductive or abductive reasoning tasks. Similar observation has been investigated by other reasoning tasks, including IQ test (Zhang et al, 2019a(Zhang et al, ,b, 2021b, number sense , causal reasoning (Edmonds et al, 2018(Edmonds et al, , 2019bZhang et al, 2021a), and more generic generalization tasks (Lake et al, 2015;Xie et al, 2021;Li et al, 2021;.…”