2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7471723
|View full text |Cite
|
Sign up to set email alerts
|

Shifted and convolutive source-filter non-negative matrix factorization for monaural audio source separation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…To our knowledge, this is the first time in the literature to reveal the flaws and limitations of the excitation-filter product representation assumed in the CWT domain. Although the NMF-based model presented in [18] also uses the excitation-filter product representation in the CWT domain, we did not consider this model for comparison since it is for supervised settings.…”
Section: B Source-filter Model Representation In Cwt Domainmentioning
confidence: 99%
See 1 more Smart Citation
“…To our knowledge, this is the first time in the literature to reveal the flaws and limitations of the excitation-filter product representation assumed in the CWT domain. Although the NMF-based model presented in [18] also uses the excitation-filter product representation in the CWT domain, we did not consider this model for comparison since it is for supervised settings.…”
Section: B Source-filter Model Representation In Cwt Domainmentioning
confidence: 99%
“…Several attempts have already been made to incorporate the source-filter model into NMF to enhance the performance of audio source separation and multipitch analysis [11]- [18]. In these studies, a spectrum of each source is simply represented as the product of excitation and filter spectra.…”
Section: Introductionmentioning
confidence: 99%
“…Though it is more complicated, 2DNMF is a more compact representation than 1D convolutional NMF (in time only), in which it is necessary to store a different template for each pitch shift of each instrument. We note that pitch shifts in real instruments are more complicated than shifting all frequency bins by the same perceptual amount [14], but the basic version of 2DNMF is fine for our purposes.…”
Section: Nmf2d For Joint Blind Factorization / Filteringmentioning
confidence: 99%
“…The work of 2-D deconvolution NMF, Kirbiz and Gunsel [72] proposed to improve the perceptual qualities of separated STFT sources by means of a defined perceptual evaluation of audio quality (PEAQ) auditory model [73]. Based on the source-filter theory, Tomohiko and Hirokazu [74] describe the spectrogram of a mixture signal as the sum of the products between the shifted copies of excitation spectrum templates and filter spectrum templates to reduce the separation error caused by using a shifted copy of a spectrum template to represent the spectra of different fundamental frequency F 0 s. They developed the shifted NMF model in the form of…”
Section: Shift Constraints On Wmentioning
confidence: 99%
“…The multiplicative update is not the only approach in solving the iterative optimization problem of NMF; the auxiliary-function method [85] is an alternative method adopted in work [64,74,86,87]. Meanwhile, the distance function D(•) is extended into Kullback-Leibler divergence [88][89][90], Itakura-Saito divergence [91], the developing β-divergence [92][93][94], α-β-divergence [95] and α-β-γ-divergence [96].…”
Section: Auxiliary Function Iterative Algorithmmentioning
confidence: 99%