Smartphone technology provides new opportunities for recording standardized voice samples of patients and sending the files by e-mail to the voice laboratory. This drastically improves the collection of baseline data, as used in research on efficiency of voice treatments. However, the basic requirement is the suitability of smartphones for recording and digitizing pathologic voices (mainly characterized by period perturbations and noise) without significant distortion. In this experiment, two smartphones (a very inexpensive one and a high-level one) were tested and compared with direct microphone recordings in a soundproof room. The voice stimuli consisted in synthesized deviant voice samples (median of fundamental frequency: 120 and 200 Hz) with three levels of jitter and three levels of added noise. All voice samples were analyzed using PRAAT software. The results show high correlations between jitter, shimmer, and noise-to-harmonics ratio measured on the recordings via both smartphones, the microphone, and measured directly on the sound files from the synthesizer. Smartphones thus appear adequate for reliable recording and digitizing of pathologic voices.
This study presents stochastic models of jitter. Jitter designates small, random, involuntary perturbations of the glottal cycle lengths. Jitter is a base-line phenomenon that may be observed in all voiced speech sounds. Knowledge of its properties is therefore relevant to the acoustic modeling, analysis, and synthesis of voice quality. Also, models of jitter are conceptual frameworks that enable experimenters and clinicians to distinguish jitter in particular from aperiodic cycle length patterns in general. Vocal jitter is modeled by means of the ribbon model of the glottal vibration combined with stochastic models of the disturbances of the instantaneous frequency. The disturbance model comprises correlation-free noise and vocal microtremor. Properties of jitter that are simulated are the stochasticity, stationarity, and normality of the decorrelated cycle length perturbations, the size of decorrelated jitter, the correlation between the perturbations of neighboring glottal cycles, the modulation level and modulation frequency owing to microtremor, the asynchrony between external disturbances and glottal cycles, the dependence of the size of jitter on the average glottal cycle length, and the relation between jitter and laryngeal pathologies. Modeled jitter is discussed in the light of measured jitter, as well as the physiological and statistical plausibility of the model parameters.
This article presents an analysis of entrained oscillations of the right and left vocal folds in the presence of asymmetries. A simple one-mass model is proposed for each vocal fold. A stiffness asymmetry and open glottis oscillations are considered first, and regions of oscillation are determined by a stability analysis and an averaging technique. The results show that the subglottal threshold pressure for 1:1 entrainment increases with the asymmetry. Within that region, both folds oscillate with the same amplitude and with the lax fold delayed in time with regard to the tense fold. At large asymmetries, a region involving several different phase entrainments or toroidal regimes at constant threshold pressure appears. The effect of vocal fold collisions and asymmetry in the damping coefficients of the oscillators are explored next by means of numerical analyses. It is shown that the damping asymmetry expands the 1:1 entrainment region at low subglottal pressures across the whole asymmetry range. In the expanded region, the oscillator with the lowest natural frequency is dominant and the other oscillator has a large phase advance and small amplitude. The theoretical results are finally compared with data collected from a mechanical replica of the vocal folds.
Vocal microtremor designates a normal slow modulation of the vocal cycle lengths of speakers who do not suffer from pathological tremor of the limbs and whose voices are not perceived as tremulous. Vocal microtremor is therefore distinct from pathological vocal tremor. The objective is to report data about the modulation frequency and modulation level owing to vocal microtremor. The modulation data have been obtained for vowels [a], [i], and [u] sustained by normophonic and mildly dysphonic male and female speakers. The results are the following. First, modulation frequencies and relative modulation levels do not differ significantly for male and female speakers, normophonic and mildly dysphonic speakers, as well as for vowel timbres [a], [i], and [u]. Second, the typical interquartile intervals of the modulation frequency and modulation level are equal to 2.0-4.7 Hz and 0.4%-1.3%, respectively. Third, dissimilarities between data reported by different studies are due to different cutoff frequencies below which spectral peaks are considered not to contribute to vocal microtremor.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.