2014
DOI: 10.1186/s13636-014-0026-5
|View full text |Cite
|
Sign up to set email alerts
|

Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints

Abstract: In this paper, unsupervised learning is used to separate percussive and harmonic sounds from monaural non-vocal polyphonic signals. Our algorithm is based on a modified non-negative matrix factorization (NMF) procedure that no labeled data is required to distinguish between percussive and harmonic bases because information from percussive and harmonic sounds is integrated into the decomposition process. NMF is performed in this process by assuming that harmonic sounds exhibit spectral sparseness (narrowband so… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
38
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(40 citation statements)
references
References 18 publications
1
38
1
Order By: Relevance
“…For example, it is shown in [6] that enforcing temporal smoothness improves the physical meaning of the decomposition. Similarly in [7], Canadas & al. used four constraints in order to achieve a specific harmonic/percussive decomposition.…”
Section: Introductionmentioning
confidence: 87%
“…For example, it is shown in [6] that enforcing temporal smoothness improves the physical meaning of the decomposition. Similarly in [7], Canadas & al. used four constraints in order to achieve a specific harmonic/percussive decomposition.…”
Section: Introductionmentioning
confidence: 87%
“…For example, uniformly spaced subbands on the equivalent rectangular bandwidth scale are assumed in [57]. In this work, a resolution of 1/4 of a semitone in frequency is used as in [37,67]. The time-frequency representation is obtained using 8192-point STFT and integrating the frequency bins corresponding to the same 1/4 semitone interval.…”
Section: Methodsmentioning
confidence: 99%
“…NMF [33] is an unsupervised factorization technique used for linear representation of two-dimensional nonnegative data that has been successfully applied to the decomposition of audio spectrograms [34][35][36][37][38]. In the context of audio signal processing, the popularity of this technique is related to its ability to obtain parts-based representation of the most representative objects (e.g., notes and chords) by imposing non-negative constraints that allow only additive, not subtractive, combinations of the input data unlike many other linear representations such as ICA [39] and principal component analysis (PCA) [40].…”
Section: Nmf For Source Separationmentioning
confidence: 99%
“…Alternatively, models based on non-negative matrix factorization (NMF) [10] have been applied to this task. Specific NMF methods for the task of HPSS include the use of constraints such as sparsity of percussive sources along the direction of time [11], structured factorization models that take into account the quasi-stationarity of harmonic sources [12] and extensions of NMF that account for the non-stationarity of percussive components [13,14].…”
Section: Introductionmentioning
confidence: 99%