When re-synthesizing individual head related transfer functions (HRTFs) with a microphone array, smoothing HRTFs spectrally and/or spatially prior to the computation of appropriate microphone filters may improve the synthesis accuracy. In this study, the limits of the associated HRTF modifications, until which no perceptual degradations occur, are explored. First, complex spectral smoothing of HRTFs into constant relative bandwidths was considered. As a prerequisite to complex smoothing, the HRTF phase spectra were substituted by linear phases, either for the whole frequency range or above a certain cut-off frequency only. The results indicate that a broadband phase linearization of HRTFs can be perceived for certain directions/subjects and that the thresholds can be predicted by a simple model. HRTF phase spectra can be linearized above 1 kHz without being detectable. After substituting the original phase by a linear phase above 5 kHz, HRTFs may be smoothed complexly into constant relative bandwidths of 1/5 octave, without introducing noticeable artifacts. Second, spatially smoother HRTF directivity patterns were obtained by levelling out spatial notches. It turned out that spatial notches do not have to be retained if they are less than 29 dB below the maximum level in the directivity pattern.
In order to take into account spatial information into binaural recordings, it is common practice to use so-called artificial heads. Disadvantageously artificial heads are inherently non-individual and bulky devices. Alternatively, the individual frequency-dependent directivity pattern of human head related transfer functions (HRTFs) can also be approximated by a microphone array with appropriate filters [Rasumow et al. (2011)]. Such a setup may be referred to as a virtual artificial head (vah). The filters for the application of the vah can be derived by minimizing a narrow band cost function including regularization constraints. As a first approach, it is appropriate to apply a least-squares cost function. The major advantage is its closed form solution [cf. Rasumow et al. (2011)], whereas from a psychoacoustically point of view, it seems more reasonable to minimize the dB-error instead. The latter cost function must, however, be minimized iteratively. We propose a minimization procedure for and present first results regarding the subjective appraisal of binaural filters derived using both cost functions. Future work includes the extension of this work to binaural cost functions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.