Phase Retrieval With Bregman Divergences and Application to Audio Signal Recovery

Vial, Pierre-Hugo; Magron, Paul; Oberlin, Thomas; Févotte, Cédric

doi:10.1109/jstsp.2021.3051870

Cited by 20 publications

(25 citation statements)

References 73 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where A † is the Moore-Penrose pseudo-inverse of A defined as A † = (A H A) .−1 A H , which encodes the inverse STFT. This iterative scheme, known as the Griffin-Lim (GL) algorithm, is proved to converge to a critical point of the quadratic loss in (1) [19], and can also be obtained by majorization-minimization [20] or using a gradient descent scheme [17]. Improvements of this algorithm notably include accelerated [21] and real-time purposed versions [22].…”

Section: Phase Recoverymentioning

confidence: 99%

“…In [17], we proposed to replace the quadratic loss in problem (1) with Bregman divergences, which encompass the β-divergence [15] and its special cases, the KL and IS divergences. A Bregman divergence D ψ is defined from a strictly-convex, continuously-differentiable generating function ψ (with derivative ψ ′ ) as follows:…”

Section: Phase Recovery With the Bregman Divergencementioning

confidence: 99%

“…Typical Bregman divergences with their generating function and derivative can be found, e.g., in [17] (see Table 1).…”

Section: Phase Recovery With the Bregman Divergencementioning

confidence: 99%

“…In [17] we derived two algorithms for solving both problems, based on gradient descent and alternating direction method of multipliers (ADMM).…”

Section: Phase Recovery With the Bregman Divergencementioning

confidence: 99%

“…These divergences are acknowledged for their superior performance in audio spectral decomposition applications such as NMF-based source separation [16]. In a previous work [17], we addressed phase recovery with the Bregman divergences in a single-source setting. Here, we propose to extend this approach to a single-channel and multiple-sources framework, where the mixture's information can be exploited.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Phase recovery with Bregman divergences for audio source separation

Magron,

Vial,

Oberlin

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Time-frequency audio source separation is usually achieved by estimating the short-time Fourier transform (STFT) magnitude of each source, and then applying a phase recovery algorithm to retrieve time-domain signals. In particular, the multiple input spectrogram inversion (MISI) algorithm has shown good performance in several recent works. This algorithm minimizes a quadratic reconstruction error between magnitude spectrograms. However, this loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. In this paper, we propose to reformulate phase recovery in audio source separation as a minimization problem involving Bregman divergences. To optimize the resulting objective, we derive a projected gradient descent algorithm. Experiments conducted on a speech enhancement task show that this approach outperforms MISI for several alternative losses, which highlights their relevance for audio source separation applications.

show abstract