2017
DOI: 10.3389/fpsyg.2017.00639
|View full text |Cite
|
Sign up to set email alerts
|

Modeling Timbre Similarity of Short Music Clips

Abstract: There is evidence from a number of recent studies that most listeners are able to extract information related to song identity, emotion, or genre from music excerpts with durations in the range of tenths of seconds. Because of these very short durations, timbre as a multifaceted auditory attribute appears as a plausible candidate for the type of features that listeners make use of when processing short music excerpts. However, the importance of timbre in listening tasks that involve short excerpts has not yet … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…Müller (2015) explained this by pointing to the potential of MFCCs to capture aspects like instrumentation and timbre. In line with this view, Müllensiefen and Siedenburg (2017) used MFCCs as external criteria for the validation of (dis)similarity ratings in music psychology.…”
Section: Methodsmentioning
confidence: 99%
“…Müller (2015) explained this by pointing to the potential of MFCCs to capture aspects like instrumentation and timbre. In line with this view, Müllensiefen and Siedenburg (2017) used MFCCs as external criteria for the validation of (dis)similarity ratings in music psychology.…”
Section: Methodsmentioning
confidence: 99%
“…Audio features have been widely used in timbre research for explaining quantitatively the dimensions of timbre spaces (Grey and Gordon, 1978 ; Iverson and Krumhansl, 1993 ; McAdams et al, 1995 ; Lakatos, 2000 ), affective ratings (Laurier et al, 2009 ; Farbood and Price, 2017 ; McAdams et al, 2017 ), and the perceptual similarity of short music clips (Siedenburg and Müllensiefen, 2017 ). Most often, the spectral features are derived from statistical computations on a spectrogram, whereas the temporal features are usually extracted from the raw waveform.…”
Section: Introductionmentioning
confidence: 99%
“…However, Schellenberg et al (1999) argued, based on the facts that their plinks were too brief to present tempo or relative pitch information and that listeners recognized reversed plinks less accurately than regular plinks (despite them sharing acoustic features related to absolute pitch), that listeners must have utilized spectral timbral cues to identify the plinks. Siedenburg and Mu ¨llensiefen (2017) developed statistical models to determine which acoustic factors enable listeners to categorize unfamiliar plinks. The authors proposed that multiple timbre-related acoustic cues contribute to listeners' abilities to judge plink similarity.…”
mentioning
confidence: 99%