2019 International Conference on Content-Based Multimedia Indexing (CBMI) 2019
DOI: 10.1109/cbmi.2019.8877462
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Task Music Representation Learning from Multi-Label Embeddings

Abstract: This paper presents a novel approach to music representation learning. Triplet loss based networks have become popular for representation learning in various multimedia retrieval domains. Yet, one of the most crucial parts of this approach is the appropriate selection of triplets, which is indispensable, considering that the number of possible triplets grows cubically. We present an approach to harness multitag annotations for triplet selection, by using Latent Semantic Indexing to project the tags onto a high… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 18 publications
0
9
0
Order By: Relevance
“…Track-relatedness by Collection Membership: The first evaluation is based on common approaches to representation learning which use membership to a class [8], label [24] or identity [19,27] to select positive and negative examples for triplet based neural networks. As can be observed in Figure 5 this approach tends to learn representation which focus on collection related features and acoustic artifacts.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Track-relatedness by Collection Membership: The first evaluation is based on common approaches to representation learning which use membership to a class [8], label [24] or identity [19,27] to select positive and negative examples for triplet based neural networks. As can be observed in Figure 5 this approach tends to learn representation which focus on collection related features and acoustic artifacts.…”
Section: Resultsmentioning
confidence: 99%
“…A differentiated evaluation of the learned semantic context as provided in this paper was missing. Concerning Representation Learning the presented paper in mostly related to [24] in which we extended the Million Song Dataset (MSD) [2] with additional ground truth multi-label assignments for Moods, Styles and Themes. Further, we extended the single-label Genre labels provided in [26] to multilabel assignments.…”
Section: Representation Learning (Rl)mentioning
confidence: 99%
See 3 more Smart Citations