“…Several works have enhanced the final feature representation by combining global and local features of pedestrians [13,101,110,120,136,142,147]. Due to its good performance in generating images and feature learning, GAN is widely used for person Re-ID tasks [17,22,29,40,41,72,119,153,154,157,159]. To alleviate the shortage of information in single-frame images, some researchers have used the complementary spatial and temporal cues of video sequences to effectively fuse more information in the video sequences [19,26,36,62,129,132].…”