An acoustic simulator based on the finite-difference time-domain (FDTD) method was evaluated by acoustic measurements on solid models of the vocal tract. Three-dimensional vocal tract (3D VT) shapes for a male subject during production of the five Japanese vowels were measured by magnetic resonance imaging. Transfer functions of the 3D VT shapes were computed by the acoustic simulator. The accuracy of the finite-difference algorithm was second-order in time and fourth-order in space. From the same 3D VT shapes, solid models were made of epoxide resin by a stereo-lithographic technique, and their acoustic transfer functions were measured using a time-stretched pulse signal. The calculated and measured spectra were compared up to 8 kHz. Although locations of major poles and zeros were common between calculated and measured spectra, the calculated spectra showed a larger number of small zeros, possibly caused by underestimation of VT acoustic losses in the simulation. However, the lower four formants were simulated remarkably accurately, with a mean absolute error of only 2.2 % compared with the acoustic measurements. These facts indicated that the acoustic simulator can reasonably account for acoustic phenomena within the VT.
The acoustic effects of the laryngeal cavity on the vocal tract resonance were investigated by using vocal tract area functions for the five Japanese vowels obtained from an adult male speaker. Transfer functions were examined with the laryngeal cavity eliminated from the whole vocal tract, volume velocity distribution patterns were calculated, and susceptance matching analysis was performed between the laryngeal cavity and the vocal tract excluding the laryngeal cavity (vocal tract proper). It was revealed that the laryngeal cavity generates one of the formants of the vocal tract, which is the fourth in the present study. At this formant, the resonance of the laryngeal cavity (the 1/4 wavelength resonance) induces the open-tube resonance of the vocal tract proper (the 3/2 wavelength resonance). At the other formants, on the other hand, the vocal tract proper acts as a closed tube, because the laryngeal cavity has only a small contribution to generating these formants and the effective closed end of the whole vocal tract is the junction between the laryngeal cavity and the vocal tract proper.
It has been suggested that the first spectral peak and the first two spectral notches of head-related transfer functions (HRTFs) are cues for sound localization in the median plane. Therefore, to examine the mechanism for generating spectral peaks and notches, HRTFs were calculated from four head shapes using the finite-difference time-domain method. The comparison between HRTFs calculated from the whole head and the pinna-related transfer functions calculated from the segmented pinna indicated that the pinna determines the basic peak-notch pattern of the HRTFs. An analysis of the distribution patterns of pressure nodes and anti-nodes on the pinna computed in the steady state for sinusoidal excitations confirmed that the first three peaks correspond to the first three normal modes of the pinna. The analysis also revealed that at the first spectral notch frequencies, one or two anti-nodes appeared in the cymba and the triangular fossa, and a node developed in the concha. Furthermore, according to changes in the instantaneous pressure distribution patterns on the pinna, three types of mechanisms were hypothesized for inducing the node in the concha depending on the source elevation angle.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.