Self-attention visualization GeneralizationFigure 1: Overview of the proposed approach. We use the same neural controller (left picture) inside each voxel, with shared parameters. The middle picture is a biped with the attention matrices of the different voxels. Each controller uses self-attention to compute importance scores (𝑨) among the inputs sensed by its voxel. We also find evolved controllers to generalize to unseen morphologies (right picture; color represents the ratio between the voxel current area and its rest area: red stands for contraction, green for expansion, yellow for no change).