“…Compared with unsupervised methods [4,19], supervised methods [5,14,17,18,20] can yield better performance with the support of label supervision. With the rapid development of deep neural network, deep hashing methods [1,2,7,11,12,16,21] have demonstrated superior performance over non-deep hashing methods and achieved state-of-the-art results on public benchmarks. However, among mainstream deep hashing frameworks, humanannotated labels purely supervise the distribution alignment of hash code embedding, yet fail to trigger context-aware visual representation learning, let alone optimal binary codes generation.…”