2015
DOI: 10.1121/1.4923156
|View full text |Cite
|
Sign up to set email alerts
|

Underdetermined reverberant acoustic source separation using weighted full-rank nonnegative tensor models

Abstract: In this paper, a fusion of K models of full-rank weighted nonnegative tensor factor two-dimensional deconvolution (K-wNTF2D) is proposed to separate the acoustic sources that have been mixed in an underdetermined reverberant environment. The model is adapted in an unsupervised manner under the hybrid framework of the generalized expectation maximization and multiplicative update algorithms. The derivation of the algorithm and the development of proposed full-rank K-wNTF2D will be shown. The algorithm also enco… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…Y.-H. Yang [29] employed dictionary learning methods to estimate the subspace structures of musical sources and introduced a new approach called multiple low-rank representations (MLRR) that is used for decomposition by the learnt dictionaries. Al Tmem et al [2,3,22,27] proposed an algorithm based on the factorization method with a hybrid framework that combined both the multiplicative update and the Expectation-Maximization algorithms. Wang et al [26] learned the ideal binary mask using deep neural networks, and source separation issues were regarded as binary classification problems.…”
Section: Relation To Previous Workmentioning
confidence: 99%
“…Y.-H. Yang [29] employed dictionary learning methods to estimate the subspace structures of musical sources and introduced a new approach called multiple low-rank representations (MLRR) that is used for decomposition by the learnt dictionaries. Al Tmem et al [2,3,22,27] proposed an algorithm based on the factorization method with a hybrid framework that combined both the multiplicative update and the Expectation-Maximization algorithms. Wang et al [26] learned the ideal binary mask using deep neural networks, and source separation issues were regarded as binary classification problems.…”
Section: Relation To Previous Workmentioning
confidence: 99%
“…Early works for speech separation solely utilized the audio signal [3][4][5][6][7]. The audio only speech separation problem is inherently challenging due to its ambiguity, making it difficult to achieve satisfactory outcomes without additional information, for example, prior knowledge or certain microphone configuration.…”
Section: Introductionmentioning
confidence: 99%
“…The process of distinguishing particular audio sources from a mixture of audio signals using visual indicators as further information is known as audio-visual source separation. This method differs from conventional ones that exclusively rely on the audio stream for source separation [1][2][3][4][5]. In other words, it is a technique that utilizes both auditory and visual information to separate individual sound sources from a mixed audio signal.…”
Section: Introductionmentioning
confidence: 99%