Abstract:This paper describes a novel approach for localization of multiple sources overlapping in time. The proposed algorithm relies on acoustic maps computed in multi-microphone settings, which are descriptions of the distribution of the acoustic activity in a monitored area. Through a proper processing of the acoustic maps, the positions of two or more simultaneously active acoustic sources can be estimated in a robust way. Experimental results obtained on real data collected for this specific task show the capabil… Show more
“…In addition, variations on the GCC-PHAT have been employed, such as in [30] where the authors were able to estimate jointly the DOA and pitch of two moving sources using a linear array of six microphones in reverberative simulated scenarios. Another variation was presented in [31], where the authors build an acoustic map of a room using a 13-microphone linear array, based on the GCC-PHAT technique, to directly estimate the ITD of multiple users. A detailed description of the GCC-PHAT methodology is provided in Section 4.3.1.…”
Section: Background On Multi-doa Estimationmentioning
Estimating the directions of arrival (DOAs) of multiple simultaneous mobile sound sources is an important step for various audio signal processing applications. In this contribution, we present an approach that improves upon our previous work that is now able to estimate the DOAs of multiple mobile speech sources, while being light in resources, both hardware-wise (only using three microphones) and software-wise. This approach takes advantage of the fact that simultaneous speech sources do not completely overlap each other. To evaluate the performance of this approach, a multi-DOA estimation evaluation system was developed based on a corpus collected from different acoustic scenarios named Acoustic Interactions for Robot Audition (AIRA).
“…In addition, variations on the GCC-PHAT have been employed, such as in [30] where the authors were able to estimate jointly the DOA and pitch of two moving sources using a linear array of six microphones in reverberative simulated scenarios. Another variation was presented in [31], where the authors build an acoustic map of a room using a 13-microphone linear array, based on the GCC-PHAT technique, to directly estimate the ITD of multiple users. A detailed description of the GCC-PHAT methodology is provided in Section 4.3.1.…”
Section: Background On Multi-doa Estimationmentioning
Estimating the directions of arrival (DOAs) of multiple simultaneous mobile sound sources is an important step for various audio signal processing applications. In this contribution, we present an approach that improves upon our previous work that is now able to estimate the DOAs of multiple mobile speech sources, while being light in resources, both hardware-wise (only using three microphones) and software-wise. This approach takes advantage of the fact that simultaneous speech sources do not completely overlap each other. To evaluate the performance of this approach, a multi-DOA estimation evaluation system was developed based on a corpus collected from different acoustic scenarios named Acoustic Interactions for Robot Audition (AIRA).
“…However, as we will show in Section VI, the considered models defined by (5), (14) and (18) return inaccurate estimates if the sources are not located on their postulated GPs. This motivated us to tackle this problem for the case of multi-source TDOA localization.…”
Section: Tackling Grid Mismatch For Off-grid Sourcesmentioning
confidence: 99%
“…First of all, they should be nonlinear in general since linear functions generate dependent rows in which in principle does not increase the number of independent equations in (14). Moreover, these functions should not impair the restricted isometry property (RIP) [23] of required for a high quality reconstruction.…”
“…Surprisingly, this corresponds to orthonormalizing the rows of (see also Appendix A), which has indeed been shown to improve the RIP [7]. Having said that, by employing the operator , (14) should be modified to (18) Finally, (18) can be solved by (19) where is defined as earlier.…”
Section: Remark 3 (Identifiability Of Esmtl)mentioning
confidence: 99%
“…To obtain such a map, distance information (between source(s) and receivers) is required which becomes computationally demanding for a near-field assumption. In [14], a fingerprinting-like approach is proposed and the area is discretized into a set of GPs for which an acoustic map function is defined. Through a proper processing of the acoustic map and de-emphasizing the effect of the dominant source, they illustrate a good performance in localizing two sources, but in some situations their performance drops if the number of targets is larger than three.…”
Abstract-The problem of source localization from time-difference-of-arrival (TDOA) measurements is in general a non-convex and complex problem due to its hyperbolic nature. This problem becomes even more complicated for the case of multi-source localization where TDOAs should be assigned to their respective sources. We simplify this problem to an -norm minimization by introducing a novel TDOA fingerprinting and grid design model for a multi-source scenario. Moreover, we propose an innovative trick to enhance the performance of our proposed fingerprinting model in terms of the number of identifiable sources. An interesting by-product of this enhanced model is that under some conditions we can convert the given underdetermined problem to an overdetermined one that could be solved using classical least squares (LS). Finally, we also tackle the problem of off-grid source localization as a case of grid mismatch. Our extensive simulation results illustrate a good performance for the introduced TDOA fingerprinting paradigm as well as a significant detection gain for the enhanced model.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.