“…They take advantage of deep neural networks [17,27,31,11] to construct a mapping from the data space to the embedding space so that the Euclidean distance in the embedding space can reflect the actual semantic distance between data points, i.e., a relatively large distance between inter-class samples and a relatively small distance between intra-class samples. Recently a variety of deep metric learning methods have been proposed and have demonstrated strong effectiveness in various tasks, such as image retrieval [30,23,19,5], person re-identification [26,37,48,2], and geo-localization [35,14,34]. Figure 1.…”