The huge number of cores existing in current Graphics Processor Units (GPUs) provides these devices with computing capabilities that can be exploited by distributed applications. In particular, these capabilites have been used in crowd simulations for enhancing the crowd rendering, and even for simulating continuum crowds. However, GPUs have not been used for simulating large crowds of complex agents, since these simulations require distributed architectures that can support huge amounts of agents. In this paper, we propose a GPU-based multi-agent system for crowd simulation. Concretely, we propose the use of an on-board GPU to implement some of the tasks that a distributed server for crowd simulations should perform. The huge number of cores in the GPU is used to simultaneously validate movement requests from different agents, greatly reducing the server response time. Since this task represents the critical data path, the use of this hardware significantly increases the parallelism achieved with respect to the implementation of the same distributed server on a CPU. An application example shows that the system can support agents with complex navigational behaviors.