Time-domain computer simulation of sound production in airways is a widely used tool, both for research and synthetic speech production technology. Speed of computation is generally the rationale for one-dimensional approaches to sound propagation and radiation. Transmission line and wave-reflection (scattering) algorithms are used to produce formant frequencies and bandwidths for arbitrarily shaped airways. Some benchmark graphs and tables are provided for formant frequencies and bandwidth calculations based on specific mathematical terms in the one-dimensional Navier-Stokes equation. Some rules are provided here for temporal and spatial discretization in terms of desired accuracy and stability of the solution. Kinetic losses, which have been difficult to quantify in frequency-domain simulations, are quantified here on the basis of the measurements of Scherer, Torkaman, Kucinschi, and Afjeh [(2010). J. Acoust. Soc. Am. 128(2), 828-838].
Long-distance vocal communication by birds and mammals, including humans, is facilitated largely by radiation efficiency from the mouth or beak. Here, this efficiency is defined and quantified. It depends on frequency content of vocalization, mouth opening, head and upper body geometry, and directionality. Each of these factors is described mathematically with a piston-in-a-sphere model. While this model is considered a classic, never before has the high frequency solution been applied in detail to vocalization. Results indicate that frequency content in the 1-50 kHz range can be radiated with nearly 100% efficiency if a reactance peak in the radiation impedance is utilized with adjustments of head size, mouth opening, and beam direction. Without these adjustments, radiation efficiency is generally below 1%, especially in human speech where a high fundamental frequency is a disadvantage for intelligibility. Thus, two distinct modes of vocal communication are identified, (1) short range with optimized information transfer and (2) long range with maximum efficiency for release of acoustic power.
The traditional source filter theory of voice production describes a linear relationship between the source (glottal flow pulse) and the filter (vocal tract). Such a linear relationship does not allow for nor explain how changes in the filter may impact the stability and regularity of the source. The objective of this experiment was to examine what effect unpredictable changes to vocal tract dimensions could have on fo stability and individual harmonic intensities in situations in which low frequency harmonics cross formants in a fundamental frequency glide. To determine these effects, eight human subjects (five male, three female) were recorded producing fo glides while their vocal tracts were artificially lengthened by a section of vinyl tubing inserted into the mouth. It was hypothesized that if the source and filter operated as a purely linear system, harmonic intensities would increase and decrease at nearly the same rates as they passed through a formant bandwidth, resulting in a relatively symmetric peak on an intensity-time contour. Additionally, fo stability should not be predictably perturbed by formant/harmonic crossings in a linear system. Acoustic analysis of these recordings however revealed that harmonic intensity peaks were asymmetric in 76% of cases and 85% of fo instabilities aligned with a crossing of one of the first four harmonics with the first three formants. These results provide further evidence that non-linear dynamics in the source-filter relationship can impact fo stability as well as harmonic intensities as harmonics cross through formant bandwidths.
Objectives/Hypothesis To test the hypothesis that subligamental cordectomy produces superior acoustic outcome than subepithelial cordectomy for early (T1-2) glottic cancer that requires complete removal of the superficial lamina propria but does not involve the vocal ligament. Study Design Computer simulation Methods A computational tool for vocal fold surgical planning and simulation (the National Center for Voice and Speech Phonosurgery Optimizer-Simulator) was used to evaluate the acoustic output of alternative vocal fold morphologies. Four morphologies were simulated: normal, subepithelial cordectomy, subligamental cordectomy, and transligamental cordectomy (partial ligament resection). The primary outcome measure was the range of fundamental frequency (F0) and sound pressure level (SPL). A more restricted F0-SPL range was considered less favorable because of reduced acoustic possibilities given the same range of driving subglottic pressure and identical vocal fold posturing. Results Subligamental cordectomy generated solutions covering an F0-SPL range 82% of normal for a rectangular vocal fold. In contrast, transligamental and subepithelial cordectomies produced significantly smaller F0-SPL ranges, 57% and 19% of normal, respectively. Conclusion This study illustrates the use of the Phonosurgery Optimizer-Simulator to test a specific hypothesis regarding the merits of two surgical alternatives. These simulation results provide theoretical support for vocal ligament excision with maximum muscle preservation when superficial lamina propria resection is necessary but the vocal ligament can be spared on oncological grounds. The resection of more tissue may paradoxically allow the eventual recovery of a better speaking voice, assuming glottal width is restored. Application of this conclusion to surgical practice will require confirmatory clinical data.
Voice production is an inefficient process in terms of energy expended versus acoustic energy produced. A traditional efficiency measure, glottal efficiency, relates acoustic power radiated from the mouth to aerodynamic power produced in the trachea. This efficiency ranges between 0.0001 % and 1.0 %. It involves lung pressure, and hence would appear to be a useful effort measure for a given acoustic output. Difficulty in the combined measurement of lung pressure and tracheal airflow, however, has impeded clinical application of glottal efficiency. This paper utilizes the large data base from Schutte (1980) and a few new measurements to validate a pressure conversion ratio (PCR) as a substitute for glottal efficiency. PCR has the potential for wide application due to low cost and ease of use in clinics and vocal studios.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.