Learning to cooperate: Emergent communication in multi-agent navigation

Ivana Kajić, Centre for Theoretical Neuroscience, University of Waterloo, Waterloo, Ontario, Canada
Eser Aygün, DeepMind, Montréal, Quebec, Canada
Doina Precup, DeepMind, Montréal, Quebec, Canada

AbstractEmergent communication in artificial agents has been studied to understand language evolution, as well as to develop artificial systems that learn to communicate with humans. We show that agents performing a cooperative navigation task in various gridworld environments learn an interpretable communication protocol that enables them to efficiently, and in many cases, optimally, solve the task. An analysis of the agents' policies reveals that emergent signals spatially cluster the state space, with signals referring to specific locations and spatial directions such as \emph{left}, \emph{up}, or \emph{upper left room}. Using populations of agents, we show that the emergent protocol has basic compositional structure, thus exhibiting a core property of natural language.

The Document

Return to previous page