“…The pose attention-guided appearance network and the pose attention-guided generation network progressively model the appearance and shape of a person to synthesise a person image with the target pose, while keeping the appearance and identity constant. The discriminative re-ID module is trained with the quartet loss function [18] to boost re-ID performance. As shown in Figure 2, the condition image, I c , and the condition pose, P c , are first fed into the appearance encoder, E A , and pose encoder, E p , to generate the appearance map, f I o , and the pose map, f P o .…”