Besides reducing undesired noise sources and limiting speech distortion, another important objective of a binaural noise reduction algorithm is the preservation of the binaural cues of all sound sources in the acoustic scene. In this paper, we consider the binaural minimum variance distortionless response beamformer with partial noise estimation (BMVDR-N), which allows to trade off between noise reduction performance and binaural cue preservation of the noise component by mixing the output signals of the BMVDR beamformer with the noisy reference microphone signals.For a directional noise source, it has been shown that incorporating an external microphone in addition to the head-mounted microphones enables both the noise reduction performance as well as the interaural time and level difference cues of the noise component to be improved in the output signals. In this paper, we consider an arbitrary noise field and analytically show that incorporating an external microphone in the BMVDR-N beamformer enables 1) a larger output signal-to-noise ratio (SNR) for the same mixing parameter, 2) the same output SNR for a larger mixing parameter, and 3) the same desired output magnitude squared coherence (MSC) of the noise component for a smaller mixing parameter to be obtained. The derived analytical expressions are firstly validated using simulated anechoic acoustic transfer functions, where the listener's head is modelled as a rigid sphere. Experimental results using recorded signals for a binaural hearing device setup in a reverberant environment also show that in a realistic scenario incorporating an external microphone in the BMVDR-N beamformer significantly improves the output SNR and reduces the mixing parameter that is required to obtain a desired output MSC of the noise component compared to using only the head-mounted microphones.
The binaural minimum-variance distortionless-response (BMVDR) beamformer is a well-known noise reduction algorithm that can be steered using the relative transfer function (RTF) vector of the desired speech source. Exploiting the availability of an external microphone that is spatially separated from the head-mounted microphones, an efficient method has been recently proposed to estimate the RTF vector in a diffuse noise field. When multiple external microphones are available, different RTF vector estimates can be obtained by using this method for each external microphone. In this paper, we propose several procedures to combine these RTF vector estimates, either by selecting the estimate corresponding to the highest input SNR, by averaging the estimates or by combining the estimates in order to maximize the output SNR of the BMVDR beamformer. Experimental results for a moving speaker and diffuse noise in a reverberant environment show that the output SNR-maximizing combination yields the largest binaural SNR improvement and also outperforms the state-of-the art covariance whitening method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.