2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698) 2003
DOI: 10.1109/icme.2003.1221722
|View full text |Cite
|
Sign up to set email alerts
|

Conventional and periodic N-grams in the transcription of drum sequences

Abstract: In this paper, we describe a system for transcribing polyphonic drum sequences from an acoustic signal to a symbolic representation. Low-level signal analysis is done with an acoustic model consisting of a Gaussian mixture model and a support vector machine. For higher-level modelling, periodic N-grams are proposed to construct a "language model" for music, based on the repetitive nature of musical structure. Also, a technique for estimating relatively long N-grams is introduced. The performance of N-grams in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0
1

Year Published

2004
2004
2006
2006

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(27 citation statements)
references
References 9 publications
0
26
0
1
Order By: Relevance
“…As a consequence, drum loop signals or drum tracks often exhibit a temporal structure. Two concurrent studies have exploited such a structure by means of a sequence model, or "language model" by analogy with large vocabulary speech recognition systems ( [Paulus and Klapuri, 2003] for drum sequences transcription, or [Gillet and Richard, 2003] for the transcription of tabla signals).…”
Section: Introductionmentioning
confidence: 99%
“…As a consequence, drum loop signals or drum tracks often exhibit a temporal structure. Two concurrent studies have exploited such a structure by means of a sequence model, or "language model" by analogy with large vocabulary speech recognition systems ( [Paulus and Klapuri, 2003] for drum sequences transcription, or [Gillet and Richard, 2003] for the transcription of tabla signals).…”
Section: Introductionmentioning
confidence: 99%
“…Herrera et al [11] compared conventional classifiers in experiments on identifying individual drum sounds. To transcribe drum sounds in drums-only audio signals, the use of N-grams [14], probabilistic models [15], and HMM&SVM [6] have been proposed. To identify drum sounds extracted from polyphonic audio signals, Sandvolt et al [17] proposed a feature-model adaptation method that is robust to the distortion of features since feature distortion caused by other sounds is a major problem.…”
Section: Related Workmentioning
confidence: 99%
“…As noted in [9], this may lead to the occurrence of very unlikely combinations in the transcription. The co-occurrence information is modelled implicitly with the combination modelling.…”
Section: Using the Modelsmentioning
confidence: 93%
“…However, in practice a small portion of these combinations contribute the majority of all combination occurrences [9,3]. This observation can be used to choose a subset of combinations to be modelled.…”
Section: Combination Modellingmentioning
confidence: 99%