Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval 2019
DOI: 10.1145/3331184.3331337
|View full text |Cite
|
Sign up to set email alerts
|

Improving Collaborative Metric Learning with Efficient Negative Sampling

Abstract: Distance metric learning based on triplet loss has been applied with success in a wide range of applications such as face recognition, image retrieval, speaker change detection and recently recommendation with the Collaborative Metric Learning (CML) model. However, as we show in this article, CML requires large batches to work reasonably well because of a too simplistic uniform negative sampling strategy for selecting triplets. Due to memory limitations, this makes it difficult to scale in high-dimensional sce… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(29 citation statements)
references
References 19 publications
0
28
0
Order By: Relevance
“…One limitation of CONVFIT, especially prominent in Full scenarios, is its quadratic time complexity. Future work will look into effective sampling strategies and adaptations towards more sampleefficient and quicker fine-tuning (Tran et al, 2019;Tian et al, 2020;O'Neill and Bollegala, 2021).…”
Section: Further Discussionmentioning
confidence: 99%
“…One limitation of CONVFIT, especially prominent in Full scenarios, is its quadratic time complexity. Future work will look into effective sampling strategies and adaptations towards more sampleefficient and quicker fine-tuning (Tran et al, 2019;Tian et al, 2020;O'Neill and Bollegala, 2021).…”
Section: Further Discussionmentioning
confidence: 99%
“…We only keep the songs whose Essentia (and consequently AVD) features can be calculated, which corresponds to a total of 204, 316 songs [21]. The playcount data is binarized by retaining values of five or higher as implicit feedback [31]. As in [8,15], in order to keep the computational burden low, we retain the top songs and users (sorted by playcounts) and we remove inactive users and items (that is, we only keep users who listened to at least 20 songs, and songs which have been listened to by at least 50 users).…”
Section: Protocolmentioning
confidence: 99%
“…Different from the RME which jointly decomposes the user-item rating matrix and the user-user co-occurrence matrix, we utilize the users' co-occurrence patterns to distinguish sets of users with extremely similar or dissimilar consumption behaviors and combine social information to change their relative positions in the metric space. It is worth mentioning that negative sampling is not usually used when calculating the similarity of users' consumption behavior, but several works [20,21] have studied the implications of negative sampling as well as various methods to improve the quality of recommendation. Contrary to previous studies [8,9] which only used the PMI formula to capture the positive similarity between users, we consider negative sampling of user similarity and find the list of users with extremely dissimilar consumption behavior for each user.…”
Section: Motivationmentioning
confidence: 99%