2022
DOI: 10.48550/arxiv.2202.08520
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Abstract: Mastering is an essential step in music production, but it is also a challenging task that has to go through the hands of experienced audio engineers, where they adjust tone, space, and volume of a song. Remastering follows the same technical process, in which the context lies in mastering a song for the times. As these tasks have high entry barriers, we aim to lower the barriers by proposing an endto-end music remastering system that transforms the mastering style of input audio to that of the target. The sys… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…Furthermore, musical feature extractors utilizing advanced deep learning techniques have also been proposed. [36] and [21] use the idea of contrastive learning, and [11] use the idea of multi-scale VQ-VAE to extract low-level features of music. Accordingly, content-based music recommendations adopting these features are being suggested, demonstrating their practical applicability [10].…”
Section: Content-based Music Recommendationmentioning
confidence: 99%
See 3 more Smart Citations
“…Furthermore, musical feature extractors utilizing advanced deep learning techniques have also been proposed. [36] and [21] use the idea of contrastive learning, and [11] use the idea of multi-scale VQ-VAE to extract low-level features of music. Accordingly, content-based music recommendations adopting these features are being suggested, demonstrating their practical applicability [10].…”
Section: Content-based Music Recommendationmentioning
confidence: 99%
“…Considering that our work attempts to relate the contents with user feedback, we use low-level features following [37]. As representation learning has been actively studied in recent years, [11,21,36] proposed models trained in a self-supervised manner that produce novel music representations. There are also front-end models used in automatic music tagging [8,31], but they are models trained to classify music into a limited, discrete range of descriptions.…”
Section: Feature Extractionmentioning
confidence: 99%
See 2 more Smart Citations