Incorporating Scalability in Unsupervised Spatio- Temporal Feature Learning

Shrivastava, Paul; Roy, Sourya; Roy-Chowdhury, Amit K.

doi:10.1109/icassp.2018.8461758

Cited by 2 publications

(4 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We investigate the contributions of the temporal losses to the re-identification TCPL (L) EUG [30] One-Shot Prog. [29] (b) Fig. 4.…”

Section: Methodsmentioning

confidence: 99%

“…(1) using the unlabeled data efficiently is important, ( TCPL EUG [30] One-Shot Progressive [29] (b) performance. In order to do that, we performed experiments with different values of λ (higher value indicates larger weight on the temporal losses) and present the results on the DukeMTMC-VideoReID dataset in Fig.…”

Section: Methodsmentioning

confidence: 99%

“…Intra-sequence temporal consistency. The intra-sequence temporal consistency loss is based on the idea of video temporal coherence [29,25,23]. While the previous works focus on learning the temporal order by considering individual frames, we use consistency as a tool for the learnt features to implicitly ignore background nuances and focus on the actual person attributes.…”

Section: Temporal Coherence As Self-supervisionmentioning

confidence: 99%

“…We propose using temporal coherence [29,25,23] as a form of self-supervision to maximally utilize the unlabeled data and learn discriminative person specific representations. Temporal coherence is motivated by the fact that features corresponding to a person in a tracklet should be focused on the discriminative aspects related to the person, such as clothing and gait, and ignore background nuances such as illumination and occlusion (see Fig.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification

Raychaudhuri¹,

Roy-Chowdhury²

2020

Preprint

Self Cite

View full text Add to dashboard Cite

While supervised techniques in re-identification are extremely effective, the need for large amounts of annotations makes them impractical for large camera networks. One-shot re-identification, which uses a singular labeled tracklet for each identity along with a pool of unlabeled tracklets, is a potential candidate towards reducing this labeling effort. Current one-shot re-identification methods function by modeling the inter-relationships amongst the labeled and the unlabeled data, but fail to fully exploit such relationships that exist within the pool of unlabeled data itself. In this paper, we propose a new framework named Temporal Consistency Progressive Learning, which uses temporal coherence as a novel self-supervised auxiliary task in the one-shot learning paradigm to capture such relationships amongst the unlabeled tracklets. Optimizing two new losses, which enforce consistency on a local and global scale, our framework can learn learn richer and more discriminative representations. Extensive experiments on two challenging video re-identification datasets -MARS and DukeMTMC-VideoReID -demonstrate that our proposed method is able to estimate the true labels of the unlabeled data more accurately by up to 8%, and obtain significantly better re-identification performance compared to the existing state-of-the-art techniques.

show abstract

“…We investigate the contributions of the temporal losses to the re-identification TCPL (L) EUG [30] One-Shot Prog. [29] (b) Fig. 4.…”

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%