2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7952162
|View full text |Cite
|
Sign up to set email alerts
|

Fully complex deep neural network for phase-incorporating monaural source separation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(11 citation statements)
references
References 21 publications
0
11
0
Order By: Relevance
“…The deep learning model learns the mapping or masking function to retrieve the clean complex spectrum from the noisy one, and simultaneously estimates the phase and amplitude information of the speech signal. Some studies have confirmed that complex spectral features lead to better performances than (log) PS features [63,64]. The second category suggests that a raw speech waveform can be directly enhanced without transforming it into spectral features [65][66][67][68][69][70].…”
Section: Improving the Intelligibility Of Speech For Simulated Electrmentioning
confidence: 99%
“…The deep learning model learns the mapping or masking function to retrieve the clean complex spectrum from the noisy one, and simultaneously estimates the phase and amplitude information of the speech signal. Some studies have confirmed that complex spectral features lead to better performances than (log) PS features [63,64]. The second category suggests that a raw speech waveform can be directly enhanced without transforming it into spectral features [65][66][67][68][69][70].…”
Section: Improving the Intelligibility Of Speech For Simulated Electrmentioning
confidence: 99%
“…Williamson et al proposed a twinhead DNN to infer both real and imaginary parts of the target spectrogram [24]. Several authors attempted to construct a fully complex-valued network by updating parameters based on complex back propagation [25,26]. However, to achieve good performance, the network needs to be constrained by sparsity.…”
Section: * Indicates Equal Contributionmentioning
confidence: 99%
“…In [14], methods such as Wiener filter and iterative procedure that incorporate phase constraints are discussed in singing voice separation systems. Lee et al [15] estimate the complex-valued STFT of music sources by a complex-valued deep neural network. PhaseNet [16] handles phase estimation as a classification problem.…”
Section: Introductionmentioning
confidence: 99%