The lack of a common clock reference is a fundamental problem when dealing with audio streams originating from or heading to different distributed sound capture or playback devices. When implementing multichannel signal processing algorithms for such kind of audio streams it is necessary to account for the unavoidable mismatches between the actual sampling rates. There are some approaches that can help to correct these mismatches, but important problems remain to be solved, among them the accurate estimation of the mismatch factors, and achieving both accuracy and computational efficiency in their correction. In this paper we present an empirical study on the performance of blind source separation and acoustic echo cancellation algorithms in this scenario. We also analyze the degradation in performance when using an approximate but efficient method to correct the rate mismatches.
One of the most important problems in frequency domain blind source separation (FDBSS) is the inconsistency across frequency in the permutation of the source estimates. According to previous studies, this problem can be reduced significantly by constraining the length of the unmixing filters. This improvement has been attributed to the smoothening of the unmixing frequency response. In this paper we study the effect of modifying these length constraints taking into account the circularity of the IDFT, and we show that the smoothening of the unmixing frequency response alone can not account for the improvements in performance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.