Neurons in visual and vestibular information integration areas of macaque brain such as medial superior temporal (MSTd) and ventral intraparietal (VIP) have been classified into congruent neurons and opposite neurons, which prefer congruent inputs and opposite inputs from the two sensory modalities, respectively. In this work, we propose a mechanistic spiking neural model that can account for the emergence of congruent and opposite neurons and their interactions in a neural circuit for multi-sensory integration. The spiking neural circuit model is adopted from an established model for the circuits of the primary visual cortex with little changes in parameters. The network can learn, based on the basic Hebbian learning principle, the correct topological organization and behaviors of the congruent and opposite neurons that have been proposed to play a role in multi-sensory integration. This work explore the constraints and the conditions that lead to the development of a proposed neural circuit for cue integration. It also demonstrates that such neural circuit might indeed be a canonical circuit shared by computations in many cortical areas.