Interspeech 2015 2015
DOI: 10.21437/interspeech.2015-217
|View full text |Cite
|
Sign up to set email alerts
|

Layered nonnegative matrix factorization for speech separation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…We executed our proposed LCNMF algorithm by comparing both TNMF and NMF. For LNMF and LCNMF, we both chose a two-layer structure, LNMF with more layers did not significantly improve the separation results [18]. The number of speech basis vectors K1 and K2 of the first and second layers was set as 120 and 30 respectively.…”
Section: Experimental Data and Parametersmentioning
confidence: 99%
See 1 more Smart Citation
“…We executed our proposed LCNMF algorithm by comparing both TNMF and NMF. For LNMF and LCNMF, we both chose a two-layer structure, LNMF with more layers did not significantly improve the separation results [18]. The number of speech basis vectors K1 and K2 of the first and second layers was set as 120 and 30 respectively.…”
Section: Experimental Data and Parametersmentioning
confidence: 99%
“…Roux et al proposed a Deep NMF model, which unfolds the NMF iterations and unties the NMF parameters for optimal separation performance [17]. Layered NMF (LNMF) stacks multiple layers of standard NMF blocks, and then combine sparse parts-based representation learned by each single-layer to interpret the speech data differently [18]. The experimental results showed that the hierarchical NMF model could significantly enhance speech separation performance [17][18].…”
Section: Introductionmentioning
confidence: 99%
“…Hsu et al [109] apply deep MF on the spectrogram matrix of a set of spoken sentences to extract several layers of frequential basis features, and is better able to separate the speakers in a mixture than a simple one-layer NMF. Thakur et al [110] used deep AA to extract sources based on the spectrograms of bioacoustics signals, with the dictionaries learnt at the first layers corresponding to archetypes on the convex hull of the data while deeper atoms being more in the center of the data.…”
Section: Audio Processingmentioning
confidence: 99%

Deep matrix factorizations

De Handschutter,
Gillis,
Siebert
2020
Preprint