Interspeech 2009 2009
DOI: 10.21437/interspeech.2009-330
|View full text |Cite
|
Sign up to set email alerts
|

Singing voice detection in polyphonic music using predominant pitch

Abstract: This paper demonstrates the superiority of energy-based features derived from the knowledge of predominant-pitch, for singing voice detection in polyphonic music over commonly used spectral features. However, such energy-based features tend to misclassify loud, pitched instruments. To provide robustness to such accompaniment we exploit the relative instability of the pitch contour of the singing voice by attenuating harmonic spectral content belonging to stable-pitch instruments, using sinusoidal modeling. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…To increase robustness to such accompaniment we then enhanced the feature by taking into account the temporal variation of the pitch contour, in order to exploit voice-pitch instability (see Section 2), as opposed to keyed-instrument pitch stability. Classification results using this enhanced feature did not show any degradation even in the presence of loud pitched instruments [23].…”
Section: Sung Segment Detectionmentioning
confidence: 87%
“…To increase robustness to such accompaniment we then enhanced the feature by taking into account the temporal variation of the pitch contour, in order to exploit voice-pitch instability (see Section 2), as opposed to keyed-instrument pitch stability. Classification results using this enhanced feature did not show any degradation even in the presence of loud pitched instruments [23].…”
Section: Sung Segment Detectionmentioning
confidence: 87%
“…People's life needs are the starting point of scientific research, and science and technology are the driving force that promotes the continuous progress of mankind (Powers 2011). The literature shows that as people pay more and more attention to the mobile Internet of Things, smartphone operating systems and audio and video communication standards have emerged one after another (Rao et al 2008).…”
Section: Related Workmentioning
confidence: 99%
“…Such distinction mistakes lead to plenty of Instrumental being labelled as Song. Aerophones and fretless stringed instruments, for example, are known to produce similar pitch modulations as the human voice [53,54]. This study focuses on improving Instrumental detection in musical databases because the current state-of-the-art algorithms are unable to generate a faultless playlist with the tag Instrumental [55,56].…”
Section: Introductionmentioning
confidence: 99%