2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings
DOI: 10.1109/icassp.2006.1661257
|View full text |Cite
|
Sign up to set email alerts
|

Acoustic Modelling of Drum Sounds with Hidden Markov Models for Music Transcription

Abstract: This paper describes two methods for applying hidden Markov models (HMMs) to acoustic modelling of drum sound events for polyphonic music transcription. The proposed methods are instrumentwise binary modelling and modelling of instrument combinations. In the first, each target instrument is modelled with a "sound" model and all target instruments share a "silence" model. Each instrument is transcribed independently from the others. In the latter method, different instrument combinations are modelled, and an ad… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 5 publications
0
2
0
Order By: Relevance
“…To get an accurate FFT, music samples are segmented into 23 ms frames [4]. When compared with other sample rates and segment size combinations, this gives the best performance [5].For windowing, Hamming window is used because the combination of Mel frequency and Hamming window gives improved results [6]. Thirteen MFCC coefficients are calculated for each window.…”
Section: ) Extracting Mfccmentioning
confidence: 99%
“…To get an accurate FFT, music samples are segmented into 23 ms frames [4]. When compared with other sample rates and segment size combinations, this gives the best performance [5].For windowing, Hamming window is used because the combination of Mel frequency and Hamming window gives improved results [6]. Thirteen MFCC coefficients are calculated for each window.…”
Section: ) Extracting Mfccmentioning
confidence: 99%
“…While this procedure has proved particularly successful on solo drum signals [5], [6], its application to polyphonic music [7]- [9] is more challenging, as most of the features used for classification are sensitive to the presence of background music. Efforts have been made lately by Paulus [10] to jointly perform the segmentation and the classification, as a single decoding process of a hidden Markov model. A second procedure consists in searching for occurrences of a reference temporal [11] or time-frequency [12] template within the music signal.…”
Section: Introductionmentioning
confidence: 99%