“…Ruis, Burghouts, & Bucur, 2021), natural language processing Baroni, 2020;Keysers et al, 2020;Kim & Linzen, 2020), and more generally (Nam & McClelland, 2021). Two fundamentally different approaches are taken by the literature; one utilizes additional data while making few changes to the conventional setup and architecture (Furrer, van Zee, Scales, & Schärli, 2020), while the other utilizes additional inductive biases that aim to support systematic generalization (Russin et al, 2019;Lake, 2019;Andreas, 2020;Nye et al, 2020;Gordon et al, 2020;Bogin et al, 2021;Chaabouni, Dessì, & Kharitonov, 2021). In this work we apply both approaches, the former through data augmentation, and the latter through high-level modularity.…”