2021
DOI: 10.1016/j.sigpro.2020.107753
|View full text |Cite
|
Sign up to set email alerts
|

Independent deeply learned matrix analysis with automatic selection of stable microphone-wise update and fast sourcewise update of demixing matrix

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 29 publications
0
6
0
Order By: Relevance
“…We explain that the problem of minimizing L with respect to W i when R jn is kept fixed can be solved by an algorithm called vectorwise coordinate descent (VCD) [15], which has been developed for the conventional IPSDTA [17]. VCD is a block coordinate descent method that cyclically update w in for each i = 1, .…”
Section: A Optimization Of Demixing Matrix W Imentioning
confidence: 99%
See 1 more Smart Citation
“…We explain that the problem of minimizing L with respect to W i when R jn is kept fixed can be solved by an algorithm called vectorwise coordinate descent (VCD) [15], which has been developed for the conventional IPSDTA [17]. VCD is a block coordinate descent method that cyclically update w in for each i = 1, .…”
Section: A Optimization Of Demixing Matrix W Imentioning
confidence: 99%
“…As a supervised extension of ILRMA as well as FDICA, independent deeply learned matrix analysis (IDLMA) [14], [15] has been proposed. Instead of using NMF, in IDLMA, a pretrained deep neural network (DNN) is used to model a source power spectrum.…”
Section: Introductionmentioning
confidence: 99%
“…More recently, deep neural network (DNN) is utilized to model the source spectral characteristics [13,14,15,16,17,18] given its powerful modeling ability. In [13,14,15,16], the supervised learning of the source spectrogram is presented, which is further combined with the ILRMA-based blind estimation of the demixing matrix and is referred as independent deeply learned matrix analysis (IDLMA). Due to the complexity of directly estimating the source spectrogram, the DNN-based supervised update of the source variance is then incorporated into the framework of IVA [17].…”
Section: Introductionmentioning
confidence: 99%
“…Taking advantage of the iterative source steering updates, the work in [18] proposes to determine the source model parameters by backing propagate the permutation invariant separation loss through multiple iterations of IVA. These DNN models [13,14,15,16,17,18] can accurately estimate the parameters of the given source model. However, they need to update the source model parameters with DNN iteratively, leading to the expensive computational cost and the mismatch between training and testing data.…”
Section: Introductionmentioning
confidence: 99%
“…We aimed to reduce bleeding sound in a fully blind manner, namely, the spatial locations of sources and microphones are unknown. We also did not use supervision of sources, such as solo-played music datasets, to avoid the mismatch between training and test data; thus, supervised deep-neural-networkbased approaches [19], [20], [21], [22], [23] are out of the scope of this paper. We propose a phase-insensitive method for blind bleeding-sound reduction, which is a modification of TCNMF: we introduce an a priori generative model for diagonal and off-diagonal elements of the frequency-wise mixing matrix to model relative leakage levels of bleeding sounds.…”
Section: Introductionmentioning
confidence: 99%