Elevation localization and head-related transfer function analysis at low frequencies

Algazi, V. Ralph; Avendaño, Carlos; Duda, Richard O.

doi:10.1121/1.1349185

Cited by 165 publications

(148 citation statements)

References 24 publications

Supporting

Mentioning

130

Contrasting

Unclassified

Order By: Relevance

“…The auditory spatial cues involved in sound localization include interaural time (and/or phase) and intensity differences (ITDs and IIDs, respectively) in azimuth and pinna/headgenerated spectral cues in elevation that also assist front-back discrimination (Algazi et al 2001;Asano et al 1990;Middlebrooks 1992;Musicant and Butler 1984;Oldfield and Parker 1984b;Rayleigh 1907;Wightman and Kistler 1989b). For pure tones, ITDs are accompanied by interaural phase differences (IPDs) that are reliable only up to ϳ1.5 kHz in humans due to phase ambiguity coupled with declining neuronal temporal coding (phase-locking) at higher frequencies (Dreyer and Delgutte 2006;Johnson 1980;Joris and Yin 1992;Klumpp and Eady 1956;Zwislocki and Feldman 1956).…”

mentioning

confidence: 99%

Influence of aging on human sound localization

Dobreva

O’Neill

Paige

2011

Journal of Neurophysiology

121

View full text Add to dashboard Cite

show abstract

mentioning

confidence: 99%

Influence of aging on human sound localization

Dobreva

O’Neill

Paige

2011

Journal of Neurophysiology

121

View full text Add to dashboard Cite

show abstract

“…On the other hand, the role of low frequencies in vertical localization has also been reported by Morimoto et al [7]. Gardner [8] and Algazi et al [9] showed that torso reflections produce spectral notches in the HRTF below 3 kHz, which are additional localization cues for an elevated source.…”

Section: Introductionmentioning

confidence: 53%

“…0, for a sound source in the median plane, a torso reflection produces a spectral notch in the HRTF below 3 kHz when it is combined with the direct sound at the ear [8,9]. Algazi et al [9] showed that such a notch varies in frequency as a function of torso reflection delay, which is elevation-dependent. The torso reflection delay tends to increase as the elevation angle increases, and reaches its maximum when the source is in the "above" region.…”

Section: A New Hypothesis On the Role Of Acoustic Crosstalk And Torsomentioning

confidence: 99%

“…Long term average spectra (LTAS) of the natural sound sources used for the experiment; the frame length was 4096 samples with a Hanning window and 50% overlap, the FFT point was 4096, and a 1/6-octave Gaussian smoothing was applied to the resulting spectral magnitude. that corresponds to the perceived position is selected (e.g., Morimoto et al [7], Algazi et al [9]). However, from a pilot test it was recognized to be a challenging and timeconsuming task to precisely localize perceived image position in the median plane, especially when the image appeared above or behind the listener.…”

Section: Test Proceduresmentioning

confidence: 99%

See 1 more Smart Citation

Sound Source and Loudspeaker Base Angle Dependency of Phantom Image Elevation Effect

Lee

2017

J. Audio Eng. Soc.

View full text Add to dashboard Cite

Early studies found that, when identical signals were presented from two loudspeakers equidistant from the listener, the resulting phantom image was elevated in the median plane and the degree of the elevation increased with the loudspeaker base angle. However, sound sources used in such studies were either unknown or limited to noise signals. In order to investigate the dependencies of the elevation effect on sound source and loudspeaker base angle in details, the present study conducted listening tests using 11 natural sources and 4 noise sources with different spectral and temporal characteristics for 7 loudspeaker base angles between 0• and 360• . The elevation effect was found to be significantly dependent on the sound source and base angle. Results generally suggest that the effect is stronger for sources with transient nature and a flat frequency spectrum than for continuous and low-frequencydominant sources. Theoretical reasons for the effect are also discussed based on head-related transfer function measurements. It is proposed that the perceived degree of elevation would be determined by a relative cue related to the spectral energy distribution at high frequencies, but by an absolute cue associated with the acoustic crosstalk and torso reflections at low frequencies.

show abstract

“…However humans are still able to localize sound in the median plane by what is known as monaural cues, which are related to the spectral changes introduced by the outer ears (i.e. pinnae) at high frequencies and other body structures like the torso at low frequencies [111]. Some studies have shown that these cues help listeners with complete hearing loss in one ear to localize the azimuth direction of a source with relatively high accuracy.…”

Section: Spectral Cuesmentioning

confidence: 99%

Application of sound source separation methods to advanced spatial audio systems.

Serrano¹

View full text Add to dashboard Cite

This thesis is related to the field of Sound Source Separation (SSS). It addresses the development and evaluation of these techniques for their application in the resynthesis of high-realism sound scenes by means of Wave Field Synthesis (WFS). Because the vast majority of audio recordings are preserved in two-channel stereo format, special up-converters are required to use advanced spatial audio reproduction formats, such as WFS. This is due to the fact that WFS needs the original source signals to be available, in order to accurately synthesize the acoustic field inside an extended listening area. Thus, an object-based mixing is required.Source separation problems in digital signal processing are those in which several signals have been mixed together and the objective is to find out what the original signals were. Therefore, SSS algorithms can be applied to existing two-channel mixtures to extract the different objects that compose the stereo scene. Unfortunately, most stereo mixtures are underdetermined, i.e., there are more sound sources than audio channels. This condition makes the SSS problem especially difficult and stronger assumptions have to be taken, often related to the sparsity of the sources under some signal transformation.This thesis is focused on the application of SSS techniques to the spatial sound reproduction field. As a result, its contributions can be categorized within these two areas. First, two underdetermined SSS methods are proposed to deal efficiently with the separation of stereo sound mixtures. These techniques are based on a multi-level thresholding segmentation approach, which enables to perform a fast and unsupervised separation of sound sources in the time-frequency domain. Although both techniques rely on the same clustering type, the features considered by each of them are related to different localization cues that enable to perform separation of either instantaneous or real mixtures. Additionally, two post-processing techniques aimed at improving the isolation of the separated sources are proposed. The performance achieved by several SSS methods in the resynthesis of WFS sound scenes is afterwards evaluated by means of listening tests, paying special attention to the change observed in the perceived spatial attributes. Although the estimated sources are distorted versions of the original ones, the masking effects involved in their spatial remixing make artifacts less perceptible, which improves the overall assessed quality. Finally, some novel developments related to the application of time-frequency processing to source localization and enhanced sound reproduction are presented.Keywords: Wave Field Synthesis, Sound Source Separation, Time Frequency Processing, Direction of Arrival, Spatial Audio Quality. ResumenEsta tesis se enmarca dentro del campo de la Separación de Fuentes Sonoras (SSS), donde se ha trabajado en el desarrollo y evaluación de estas técnicas para aplicarlas a la resíntesis de escenas sonoras de alto realismo utilizando Síntesis de Campo de Ondas...

show abstract

Elevation localization and head-related transfer function analysis at low frequencies

Cited by 165 publications

References 24 publications

Influence of aging on human sound localization

Influence of aging on human sound localization

Sound Source and Loudspeaker Base Angle Dependency of Phantom Image Elevation Effect

Application of sound source separation methods to advanced spatial audio systems.

Contact Info

Product

Resources

About