“…The natural cost in this context that leads to ICA is the mutual information among separated components, which can be shown to be equivalent to maximum likelihood estimation, and to negentropy maximization [40,59,61,64] when we constrain the demixing matrix to be orthogonal. In these approaches, one either estimates a parametric density model [61,63,65,85] along with the demixing matrix, or maximizes the information transferred in a network of non-linear units [58,67], or estimates the entropy using a parametric or nonparametric approach [58,63,68,69]. A recent semi-parametric approach uses the maximum entropy bound to estimate the entropy given the observations, and uses a numerical procedure thus resulting in accurate estimates for the entropy [42].…”