“…Under the framework of independent component analysis (ICA) [17], the BSS problems have been extensively studied and many classical algorithms have been proposed for the instantaneous mixing model such as the ''J-H'' algorithm [18], JADE [19], Infomax [20], SOBI [21] and FastICA [22] algorithms. For the more complex convolutive mixing model, one can apply either the time domain deconvolution algorithms [23][24][25] or the frequency domain separation algorithms [12][13][14][15][26][27][28][29][30][31], which often suffer from the permutation and scaling ambiguity problems. Considering the bimodal nature of human speech, we could potentially improve the separation of the source signals from their audio mixtures utilizing the audiovisual coherence obtained by the integration of visual speech.…”