Makoto OTANI†a) and Tatsuya HIRAHARA † †b) , Members SUMMARY A non-audible murmur (NAM), a very weak whisper sound produced without vocal fold vibration, has been researched in the development of a silent-speech communication tool for functional speech disorders as well as human-to-human/machine interfaces with inaudible voice input. The NAM can be detected using a specially designed microphone, called a NAM microphone, attached to the neck. However, the detected NAM signal has a low signal-to-noise ratio and severely suppressed highfrequency component. To improve NAM clarity, the mechanism of a NAM production must be clarified. In this work, an air flow through a glottis in the vocal tract was numerically simulated using computational fluid dynamics and vocal tract shape models that are obtained by a magnetic resonance imaging (MRI) scan for whispered voice production with various strengths, i.e. strong, weak, and very weak. For a very weak whispering during the MRI scan, subjects were trained, just before the scanning, to produce the very weak whispered voice, or the NAM. The numerical results show that a weak vorticity flow occurs in the supraglottal region even during a very weak whisper production; such vorticity flow provide aeroacoustic sources for a very weak whispering, i.e. NAM, as in an ordinary whispering. key words: non-audible murmur, whispered voice, whispered voice production mechanism, computational fluid dynamics, air flow through glottis
IntroductionA non-audible murmur (NAM) is a very weak vocal-tract resonance sound produced without a vocal fold vibration. The weak sound propagates through the body and it can be detected by using a sensor attached to the neck behind the ear, resulting in a NAM signal typically represented by voltage value. The NAM is inaudible, but, after applying appropriate signal amplification and processing to the detected NAM signal, it can be presented to a listener as an audible speech through an electroacoustic transducer, or to a speech recognition system, thus enabling silent-speech interfaces [1]- [3].The speech sound we typically use is an air-borne sound, which is a small and fast vibration of the air. The vibration of the air column in the vocal-tract vibrates the wall of the vocal tract, and some sound energies pass through the tissues of the neck or chest. This body-conducted sound that travels through the neck tissue can be sensed with a custom sensor modified from a microphone. Actually, speech Manuscript received February 24, 2011. Manuscript revised May 10, 2011. † The author is with the Faculty of Engineering, Shinshu University, Nagano-shi, 380-8553 Japan.† † The author is with the Faculty of Engineering, Toyama Prefectural University, Imizu-shi, 939-0398 Japan.a) E-mail: otani@cs.shinshu-u.ac.jp b) E-mail: hirahara@pu-toyama.ac. . The NAM microphone is a small condenser microphone covered with a soft impression material such as a soft silicon or a urethane elastomer, which provides a better impedance matching between biological soft tissues and the co...