2001
DOI: 10.1109/89.952489
|View full text |Cite
|
Sign up to set email alerts
|

Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech

Abstract: Abstract-Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker's vocal tract. In this paper, we propose a method-the pitch-scaled harmonic filter (PSHF)-which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
36
0
1

Year Published

2003
2003
2017
2017

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 69 publications
(38 citation statements)
references
References 48 publications
1
36
0
1
Order By: Relevance
“…Harmonics-to-noise ratio (HNR): A spectral measure of harmonics-to-noise ratio was performed using a periodic/noise decomposition method that employs a comb filter to extract the harmonic component of a signal [17][18][19]. This "pitch-scaled harmonic filter" approach uses an analysis window duration equal to an integer number of local periods (four in the current work) and relies on the property that harmonics of the fundamental frequency exist at specific frequency bins of the short-time discrete Fourier transform (DFT).…”
Section: Voice Source Propertiesmentioning
confidence: 99%
“…Harmonics-to-noise ratio (HNR): A spectral measure of harmonics-to-noise ratio was performed using a periodic/noise decomposition method that employs a comb filter to extract the harmonic component of a signal [17][18][19]. This "pitch-scaled harmonic filter" approach uses an analysis window duration equal to an integer number of local periods (four in the current work) and relies on the property that harmonics of the fundamental frequency exist at specific frequency bins of the short-time discrete Fourier transform (DFT).…”
Section: Voice Source Propertiesmentioning
confidence: 99%
“…It discriminates words in tonal languages, allows expressing emotions, discriminates questions from statements, and allows emphasizing parts of an utterance. Furthermore, pitch tracking is the basis for the separation of harmonic speech from other speech components and background noise [1].…”
Section: Introductionmentioning
confidence: 99%
“…If the signal is judged unvoiced, then indirect measures such as zero crossing rate and the ratio of low-to high-frequency energy are used to determine if the signal contains noise. In [3], estimates of simultaneous voiced and turbulence-noise components in the speech signal are obtained, but the performance of the system relies on accurate estimates of the pitch period. However, pitch estimation is a difficult task that is prone to errors (pitch doubling and pitch halving).…”
Section: Introductionmentioning
confidence: 99%