2019
DOI: 10.1109/taslp.2018.2869684
|View full text |Cite
|
Sign up to set email alerts
|

Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation

Abstract: This paper introduces a phase-aware probabilistic model for audio source separation. Classical source models in the short-time Fourier transform domain use circularly-symmetric Gaussian or Poisson random variables. This is equivalent to assuming that the phase of each source is uniformly distributed, which is not suitable for exploiting the underlying structure of the phase. Drawing on preliminary works, we introduce here a Bayesian anisotropic Gaussian source model in which the phase is no longer uniform. Suc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(18 citation statements)
references
References 55 publications
(130 reference statements)
0
18
0
Order By: Relevance
“…In particular, the standard EM algorithm using a reduced set of latent variables provides faster convergence and better separation results than the SAGE algorithm used in the literature [8]. It also compares favorably with the conventional multiplicative algorithm [3], which confirms its potential for estimating more sophisticated NMF models [15].…”
Section: Introductionmentioning
confidence: 70%
See 1 more Smart Citation
“…In particular, the standard EM algorithm using a reduced set of latent variables provides faster convergence and better separation results than the SAGE algorithm used in the literature [8]. It also compares favorably with the conventional multiplicative algorithm [3], which confirms its potential for estimating more sophisticated NMF models [15].…”
Section: Introductionmentioning
confidence: 70%
“…This is particularly interesting since in more sophisticated models where the likelihood of the data is not tractable, one cannot apply ML-MUR, and EM-MUR would then fully reveal its potential. For instance, it can be useful for estimating anisotropic Gaussian models with NMF variance [15], or in a multichannel ISNMF framework, where it is common to exploit EM algorithms [16,21].…”
Section: Resultsmentioning
confidence: 99%
“…However, this distortion metric is not well adapted to audio [7]. To alleviate this issue, complex NMF has recently been extended to KL [11] and IS [12] divergences.…”
Section: Complex Nmfmentioning
confidence: 99%
“…However, it has been extended to the KL divergence by using a primal-dual formulation of the optimization problem [11]. It has also recently been extended to the IS divergence [12] by considering a probabilistic framework based on anisotropic Gaussian (AG) distributions [13]. This family of distributions permits us to model the sources with non-uniform phase parameters and to structure the variance parameters by means of an NMF model.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation