Patch-based training for 360-degree images allows to significantly reduce the complexity compared to multichannel models while maintaining good performances. Differently from multichannel models where multi neural networks are trained in parallel to predict the score of the whole 360-degree image, a pooling stage is required to map local qualities to the global one. This step is often neglected by using a simple arithmetic mean, which does not account for (i) the non-uniformity distribution of quality and (ii) the variability among local qualities. In this paper, we analyze several pooling strategies, including basic statistic methods and adaptive pooling ones. Additionally, we propose a pooling strategy based on scene exploration behavior relying on visual scan-path. The performance analysis showed the benefit of using adaptive pooling over arithmetic mean, as well as the incorporation of perceptual properties during the pooling stage. Besides, the comparison with state-of-the-art multichannel models asserts the effectiveness of patch-based training compared to multichannel models.