PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

Liang, Hongru; Lei, Wenqiang; Chan, Paul; Yang, Zhenglu; Sun, Maosong; Chua, Tat-Seng

doi:10.1145/3394171.3414032

Cited by 36 publications

(32 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Additionally, it would be interesting to complete on some of the systems studied in this paper that are concerned with automating one single musical task to turn them into full composers. For example, rhythm patterns learned in [37] can be embedded within the process of generating further complete music compositions. In this survey we already highlighted the merge of different CI techniques together such as rule-based with GA and AIS, and such as CBR with Markov chains.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Applications of Computational Intelligence in Computer Music Composition

Siphocly

Salem

El-Horabty

2021

IJICIS

View full text Add to dashboard Cite

Engaging computers in composing musical pieces is a challenging and trending field of research. The musical tasks that can be performed or aided by computers' computational powers, are numerous. This paper is concerned with applications of computational intelligence in music composition. Its main objective is to survey various computational intelligence techniques for performing miscellaneous music composition tasks. To achieve this objective, we first define each music composition task, then we discuss the recent applications of each, and the techniques adopted in them. We also highlight the most suitable techniques for performing each task. Our study shows that the most suitable techniques for human composers imitative systems are case-based reasoning and artificial neural networks. It is also shown that Markov models are more suitable for predicting musical notes based on the given previous notes. Genetic algorithms excel in chord progressions generation. Deep neural networks are clever at capturing temporal information of a musical piece. The state-of-the-art generative adversarial networks produce music as close as possible to real compositions. At the end of this study, we shed the light on many future research directions in the field of computer music composition.

show abstract

Section: Discussionmentioning

confidence: 99%

“…The extracted patterns then serve as the basis for chromosome representation. Hongru Liang et al [37] developed a rhythm learning model based on ANNs. Table. 1 summarizes the frequently used CI techniques for automating each music composition task.…”

Section: Rhythmmentioning

confidence: 99%

Applications of Computational Intelligence in Computer Music Composition

Siphocly

Salem

El-Horabty

2021

IJICIS

View full text Add to dashboard Cite

show abstract

“…Huang et al (2016); Madjiheurem et al (2016) regard chords as words in NLP and learn chords representations using the word2vec model. Herremans and Chuan (2017); Chuan et al (2020); Liang et al (2020) divide music pieces into non-overlapping music slices with a fixed duration and train the embeddings for each slice. Hirai and Sawada (2019) cluster musical notes into groups and regard such groups as words for representation learning.…”

Section: Symbolic Music Understandingmentioning

confidence: 99%

“…Similar to natural language, music is usually represented in symbolic data format (e.g., MIDI) (Jackendoff, 2009;McMullen and Saffran, 2004) with sequential tokens, and some methods (Mikolov et al, 2013a,b) from NLP can be adopted for symbolic music understanding. Since the labeled training data for each music understanding task is usually scarce, previous works (Liang et al, 2020;Chuan et al, 2020) leverage unlabeled music data to learn music token embeddings, similar to word embeddings in natural language tasks. Unfortunately, due to their shallow structures and limited unlabeled data, such embedding-based approaches have limited capability to learn powerful music representations.…”

Section: Introductionmentioning

confidence: 99%

MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training

Zeng¹,

Tan²,

Wang³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

Symbolic music understanding, which refers to the understanding of music from the symbolic data (e.g., MIDI format, but not audio), covers many music applications such as genre classification, emotion classification, and music pieces matching. While good music representations are beneficial for these applications, the lack of training data hinders representation learning. Inspired by the success of pre-training models in natural language processing, in this paper, we develop MusicBERT, a large-scale pre-trained model for music understanding. To this end, we construct a large-scale symbolic music corpus that contains more than 1 million music songs. Since symbolic music contains more structural (e.g., bar, position) and diverse information (e.g., tempo, instrument, and pitch), simply adopting the pre-training techniques from NLP to symbolic music only brings marginal gains. Therefore, we design several mechanisms, including OctupleMIDI encoding and bar-level masking strategy, to enhance pre-training with symbolic music data. Experiments demonstrate the advantages of MusicBERT on four music understanding tasks, including melody completion, accompaniment suggestion, genre classification, and style classification. Ablation studies also verify the effectiveness of our designs of OctupleMIDI encoding and barlevel masking strategy in MusicBERT.

show abstract

“…Robust Training: Robust training has shown to be effective to improve the robustness of the models in computer vision (Szegedy et al, 2013). In Natural Language Processing, it involves augmenting the training data with carefully crafted noisy examples: semantically equivalent word substitu-tions (Alzantot et al, 2018), paraphrasing (Iyyer et al, 2018;Ribeiro et al, 2018), character-level noise (Ebrahimi et al, 2018b;Tan et al, 2020a,b), or perturbations at embedding space (Miyato et al, 2016;Liang et al, 2020). Inspired by Lei et al (2017) that nicely captures the semantic interactions in discourse relation, we regard noise as a disruptor to break semantic interactions and propose our CER approach to mitigate this phenomenon.…”

Section: Related Workmentioning

confidence: 99%

Addressing the Vulnerability of NMT in Input Perturbations

Xu¹,

Aw²,

Ding³

et al. 2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

Neural Machine Translation (NMT) has achieved significant breakthrough in performance but is known to suffer vulnerability to input perturbations. As real input noise is difficult to predict during training, robustness is a big issue for system deployment. In this paper, we improve the robustness of NMT models by reducing the effect of noisy words through a Context-Enhanced Reconstruction (CER) approach. CER trains the model to resist noise in two steps: (1) perturbation step that breaks the naturalness of input sequence with madeup words; (2) reconstruction step that defends the noise propagation by generating better and more robust contextual representation. Experimental results on Chinese-English (ZH-EN) and French-English (FR-EN) translation tasks demonstrate robustness improvement on both news and social media text. Further finetuning experiments on social media text show our approach can converge at a higher position and provide a better adaptation.

show abstract

PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

Cited by 36 publications

References 19 publications

Applications of Computational Intelligence in Computer Music Composition

Applications of Computational Intelligence in Computer Music Composition

MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training

Addressing the Vulnerability of NMT in Input Perturbations

Contact Info

Product

Resources

About