2018
DOI: 10.1109/taslp.2018.2830116
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation

Abstract: One of the most general models of music signals considers that such signals can be represented as a sum of two distinct components: a tonal part that is sparse in frequency and temporally stable, and a transient (or percussive) part composed of short term broadband sounds. In this paper, we propose a novel hybrid method built upon Nonnegative Matrix Factorisation (NMF) that decomposes the time frequency representation of an audio signal into such two components. The tonal part is estimated by a sparse and orth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 36 publications
0
6
0
Order By: Relevance
“…This algorithm relies on the anisotropic continuity of the spectrogram to separate the signal. Since the shock spectrum is continuously and smoothly distributed in frequency, the harmonic spectrum is continuously and smoothly distributed in the time direction [ 17 ]. Equation ( 1 ) is derived from the differences in the spectral representation of impact and harmonic sounds.…”
Section: Methodsmentioning
confidence: 99%
“…This algorithm relies on the anisotropic continuity of the spectrogram to separate the signal. Since the shock spectrum is continuously and smoothly distributed in frequency, the harmonic spectrum is continuously and smoothly distributed in the time direction [ 17 ]. Equation ( 1 ) is derived from the differences in the spectral representation of impact and harmonic sounds.…”
Section: Methodsmentioning
confidence: 99%
“…( 12), the proposed method directly treats the timedomain signals, and its constraint claims that the separated components satisfy the perfect reconstruction property in the time-domain as in a recent audio source separation method [24]. Some of the conventional HPSS methods (e.g., anisotropic smoothness based methods [5,7] and non-negative matrix factorization based methods [25,26]) assume additivity of power spectrograms, but it requires some statistical assumptions as discussed in [20]. In contrast, the constraint in the proposed method (additivity in the time-domain) is always justified.…”
Section: Relation To the Conventional Methodsmentioning
confidence: 99%
“…In the three-layer wavelet decomposition, the noise energy is mainly distributed on W h (1, k)and W h (2, k), and the signal energy is mainly distributed on W l (2, k). Combining (1), (5), and (6), we can obtain…”
Section: An Example Of Proposed Algorithm Based On Wavelet Transformmentioning
confidence: 99%
“…Many new technologies are introduced into BSS research. For example, signal sparse component analysis [2,3], dictionary learning [4,5], nonnegative matrix factorization [6,7], bounded component analysis [8,9], tensor decomposition [10,11], and machine learning [12]. However, these algorithms are sensitive to noise.…”
Section: Introductionmentioning
confidence: 99%