“…The classic Marr-Albus-Ito model assumes a feedforward architecture in which errors are reduced through changes in the synaptic inputs to Purkinje cells, the sole output neurons of the cerebellar cortex ( Figure 1C ; Marr, 1969 ; Albus, 1971 ; Ito and Kano, 1982 ). This is consistent with a large number of studies suggesting that long-term depression (LTD) occurs at the excitatory parallel fiber synapses onto Purkinje cells in response to error signals carried by climbing fiber inputs, effectively implementing reinforcement learning through error-driven plasticity (‘parallel fiber-Purkinje cell LTD’; Coesmans et al, 2004 ; Gilbert and Thach, 1977 ; Ito and Kano, 1982 ; Kimpo et al, 2014 ; Medina and Lisberger, 2008 ; Sakurai, 1987 ; Silva et al, 2023 ; Yang and Lisberger, 2013 ; Yang and Lisberger, 2014 , but see Schonewille et al, 2011 ). In contrast, later experimental observations raised the possibility that the learning-related changes in Purkinje cell firing could instead be due to feedback of changes occurring outside of the cerebellar cortex (‘Miles-Lisberger model’, Figure 1D ; Hirata and Highstein, 2001 ; Lisberger, 1994a ; Lisberger et al, 1994c ; Lisberger et al, 1994b ; Miles and Lisberger, 1981 ).…”