“…In the existing works, majority of efforts have been cast into extracting robust and discriminative visual representation. It has been verified that the local features, i.e., color or oriented gradient histogram [2], [3], [4], [5], [6] are effective for person ReID, and combining multiple types of features, i.e., color, texture, and spatial structure, is useful to find more informative matchings [7], [8], [9], [10], [11], [12], [13], [14], [15], [16]. On the other hand, supervised metric R. Guo learning methods-which learn a discriminative distance metric (or equivalently a low-dimensional subspace), in which the samples of same person are closer, could help the task of finding informative matchings [17], [18], [19], [20], [21], [22], [23].…”