2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011
DOI: 10.1109/icassp.2011.5946423
|View full text |Cite
|
Sign up to set email alerts
|

A trend estimation algorithm for singing pitch detection in musical recordings

Abstract: Detecting pitch values for singing voice in the presence of music accompaniment is challenging but useful for many applications. We propose a trend estimation algorithm to detect the pitch ranges of a singing voice in each time frame. The detected trend substantially reduces the difficulty of singing pitch detection by reducing a large number of wrong pitch candidates either produced by musical instruments or the overtones of the singing voice. The proposed algorithm can be applied to improve the performance o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 6 publications
0
10
0
Order By: Relevance
“…Note that represents the Euclidean distance between the observed partial-amplitude vector and the th example partial-amplitude vector, which is converted to a similarity score by (the monotonically decreasing portion of) the Gaussian function in (5). A high value of would always imply that is the true vocal F0 because its partials exhibit vocal timbre.…”
Section: A Timbral Fitness Functionmentioning
confidence: 99%
See 4 more Smart Citations
“…Note that represents the Euclidean distance between the observed partial-amplitude vector and the th example partial-amplitude vector, which is converted to a similarity score by (the monotonically decreasing portion of) the Gaussian function in (5). A high value of would always imply that is the true vocal F0 because its partials exhibit vocal timbre.…”
Section: A Timbral Fitness Functionmentioning
confidence: 99%
“…Dressler [4] devised an approach where the predominant melody is tracked by an auditory streaming model that favors unstable, high-magnitude F0 contours. Hsu et al [5] extracted the relative extents of vibrato and tremolo from each partial as features for classifying vocal and instrumental partials, and implemented F0 continuity by determining a sequence of tight ranges for the vocal F0. Tachibana et al [6] used instability in F0 and intensity as well as shortness in duration to enhance melodic components.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations