In this paper, we explore interaction history as a particular source of pressure for achieving emergent
compositional communication in multi-agent systems. We propose a training regime implementing template transfer, the idea of
carrying over learned biases across contexts. In the presented method, a sender-receiver dyad is first trained with a disentangled
pair of objectives, and then the receiver is transferred to train a new sender with a standard objective. Unlike other methods
(e.g. the obverter algorithm), the template transfer approach does not require imposing inductive biases on the architecture of
the agents. We experimentally show the emergence of compositional communication using topographical similarity, zero-shot
generalization and context-independence as evaluation metrics. The presented approach is connected to an important line of work in
semiotics and developmental psycholinguistics: it supports a conjecture that compositional communication is scaffolded on simpler
communication protocols.
Complex reasoning problems contain states that vary in the computational cost required to determine a good action plan. Taking advantage of this property, we propose Adaptive Subgoal Search (AdaSubS), a search method that adaptively adjusts the planning horizon. To this end, AdaSubS generates diverse sets of subgoals at different distances. A verification mechanism is employed to filter out unreachable subgoals swiftly and thus allowing to focus on feasible further subgoals. In this way, AdaSubS benefits from the efficiency of planning with longer subgoals and the fine control with the shorter ones. We show that AdaSubS significantly surpasses hierarchical planning algorithms on three complex reasoning tasks: Sokoban, the Rubik's Cube, and inequality proving benchmark INT, setting new state-of-the-art on INT. * equal contribution, MZ and MT implemented most of the experiments, KC led the project
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.