Tracking of Multiple Fundamental Frequencies in Diplophonic Voices

Aichinger, Philipp; Hagmüller, Martín; Schneider‐Stickler, Berit; Schoentgen, Jean; Pernkopf, Franz

doi:10.1109/taslp.2017.2761233

Cited by 10 publications

(12 citation statements)

References 49 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…5 . The method is adapted from the one described in [ 14 ]. A 32 ms Hann window with a 16 ms overlap is used for blocking signals.…”

Section: Methodsmentioning

confidence: 99%

“…A 32 ms Hann window with a 16 ms overlap is used for blocking signals. Candidate f o -tracks foγ are obtained by picking peaks in the spectrum of the GAW d ′( n ), and applying the Viterbi algorithm six times, as in the “fast” setup described in [14]. The candidate index γ = 1, 2, …, Γ , and Γ is the number of candidates.…”

Section: Methodsmentioning

confidence: 99%

“…The candidate index γ = 1, 2, …, Γ , and Γ is the number of candidates. No high-pass filtering is used, as was for the analysis of audio signals in [14]. Candidate cyclic unit pulse trains u1γ(n) are created for each foγ. Candidate cyclic pulse shapes r γ ( l ) are obtained by cross-correlating candidate u1γ(n) with the observed GAW d ′( n ).…”

Section: Methodsmentioning

confidence: 99%

“…Candidate cyclic unit pulse trains u1γ(n) are created for each foγ. Candidate cyclic pulse shapes r γ ( l ) are obtained by cross-correlating candidate u1γ(n) with the observed GAW d ′( n ). The candidate f o -tracks foγ and the pulse shapes’ discrete Fourier coefficients a γ and b γ are used in a Fourier synthesizer, which determines candidate cyclic pulse trains d1γ(n). For further details the interested reader is referred to [14].…”

Section: Methodsmentioning

confidence: 99%

“…We propose “ultra fast” candidate selection that replaces the candidate selection approach described in [14]. The estimate of the cyclic pulse train d 1 ( n ) is given by trued^1(n)=∑γ=1Γsγ⋅d1γ(n), where the binary candidate selection vector S = s γ ∈ {0, 1}, and Γ is the number of candidates.…”

Section: Methodsmentioning

confidence: 99%

See 4 more Smart Citations

Detection of extra pulses in synthesized glottal area waveforms of dysphonic voices

Aichinger

Pernkopf

Schoentgen

2019

Biomedical Signal Processing and Control

Self Cite

View full text Add to dashboard Cite

Background and objectives The description of production kinematics of dysphonic voices plays an important role in the clinical care of voice disorders. However, high-speed videolaryngoscopy is not routinely used in clinical practice, partly because there is a lack of diagnostic markers that may be obtained from high-speed videos automatically. Aim of the study is to propose and test a procedure that automatically detects extra pulses, which may occur in voiced source signals of pathological voices in addition to cyclic pulses. Material and methods Glottal area waveforms (GAW) are synthesized and used to test a detector for extra pulses. Regarding synthesis, for each GAW a cyclic pulse train is mixed with an extra pulse train, and additive noise. The cyclic pulse trains are varied across GAWs in terms of fundamental frequency, pulse shape, and modulation noise, i.e., jitter and shimmer. The extra pulse trains are varied across GAWs in terms of the height of the extra pulses, and their rates of occurrence. The energy level of the additive noise is also varied. Regarding detection, first, the fundamental frequency is estimated jointly with the cyclic pulse train waveform, second, the modulation noise is estimated, and finally the extra pulse train waveform is estimated. Two versions of the detector are compared, i.e., one that parameterizes the shapes of the cyclic pulses, and one that uses unparameterized pulse shape estimates. Two corpora are used for testing, i.e., one with 100 GAWs containing random extra pulses, and one with 25 GAWs containing extra pulses in the closed phases of each glottal phase representing subharmonic voices. Results and discussion With pulse shape parameterization (PSP) a maximum mean accuracy of 88.3% is achieved when detecting random extra pulses. Without PSP, the maximum mean accuracy reduces to 82.9%. Detection performance decreases if the energy level of additive noise is higher than −25 dB with respect to the energy of the cyclic pulse train, and if the irregularity strength exceeds 0.1. For bicyclic, i.e., subharmonic voices, the approach fails without PSP, whereas with PSP, a mean sensitivity of 87.4% is achieved for subharmonic voices. Conclusion A synthesizer for GAWs containing extra pulses, and a detector for extra pulses are proposed. With PSP, favorable detector performance is observed for not too high levels of additive noise and irregularity strengths. In signals with high noise levels, the detector without PSP outperforms the other one. Detection of extra pulses fails if irregularity strength is large. For subharmonic voices PSP must be used.

show abstract

“…5 . The method is adapted from the one described in [ 14 ]. A 32 ms Hann window with a 16 ms overlap is used for blocking signals.…”

Section: Methodsmentioning

confidence: 99%