IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005.
DOI: 10.1109/aspaa.2005.1540234
|View full text |Cite
|
Sign up to set email alerts
|

The thirteen colors of timbre

Abstract: We describe a perceptual space for timbre, define an objective metric that takes into account perceptual orthogonality and measure the quality of timbre interpolation. We discuss three timbre representations and measure perceptual judgments. We determine that a timbre space based on Mel-frequency cepstral coefficients (MFCC) is a good model for a perceptual timbre space.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
20
0
4

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(25 citation statements)
references
References 5 publications
1
20
0
4
Order By: Relevance
“…This scale, called mel, is the basis for the mel-frequency cepstral coefficients (MFCC). MFCCs are popular features in various application domains, particularly speech and speaker recognition [43] as well as musical instruments classification [36]. Equation 12 defines the scale conversion from frequency (f ) to mel (m).…”
Section: Feature Extraction In Cepstral Representationsmentioning
confidence: 99%
“…This scale, called mel, is the basis for the mel-frequency cepstral coefficients (MFCC). MFCCs are popular features in various application domains, particularly speech and speaker recognition [43] as well as musical instruments classification [36]. Equation 12 defines the scale conversion from frequency (f ) to mel (m).…”
Section: Feature Extraction In Cepstral Representationsmentioning
confidence: 99%
“…Since there are literally hundreds of signal processing features available in the literature, we performed several experiments to investigate which features would provide the best results for our domain. We experimented with features used in similar applications such as spoken digit recognition [14], music instrument recognition [15] and recognition of species of animals by their calls [16]. Due to lack of space, we will reserve these results for a future publication.…”
Section: Laser Insect Sensormentioning
confidence: 99%
“…Mel-frequency cepstral coefficients (MFCC) are often chosen as a metric for spectral envelope perception because of their linearity, orthogonality, and multidimensionality (Terasawa et al, 2012). They were also applied in a study by Rump et al (2007), which aimed at the improvement of accuracy of MFCCbased genre classification by applying the HarmonicPercussion Signal Separation (HPSS) algorithm to the music signal, and then calculating the MFCCs on the separated signals.…”
Section: Drum Separation Algorithmmentioning
confidence: 99%