“…This is especially relevant as the reliability of the clas-sical ACE has recently been called into question (Papesh, 2015), especially since a large multi-lab collaboration has failed to replicate it (Morey et al, in press). However, this debate on the ACE did not yet consider word-level effects, which have been reliably observed across many different studies Dudschig, de la Vega, De Filippis, & Kaup, 2014;Dudschig, Lachmair, de la Vega, De Filippis, & Kaup, 2012;Lachmair et al, 2011;Öttl et al, 2017;Thornton et al, 2013;Vogt, Kaup, & Dudschig, 2019; see also the pilot study in Günther et al, 2018). In the studies where this word-level effect was not observed, this can either be attributed to missing saliency of the vertical dimension in both the stimulus and response set or, as in the studies presented here, to the specific word material (novel word labels for non-experienced referents; compare Günther et al, 2018).…”