2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) 2015
DOI: 10.1109/isspit.2015.7394335
|View full text |Cite
|
Sign up to set email alerts
|

Convolutional maxout neural networks for speech separation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 12 publications
0
10
0
Order By: Relevance
“…), and CNN (Hui et al. ; Nugraha et al. ), are the current state of the art in the separation of speech and music (Stöter et al.…”
Section: Monoaural Audio Source Separationmentioning
confidence: 99%
“…), and CNN (Hui et al. ; Nugraha et al. ), are the current state of the art in the separation of speech and music (Stöter et al.…”
Section: Monoaural Audio Source Separationmentioning
confidence: 99%
“…In terms of front-end models, DNNs are the most common approach [6], though RNNs have been used for speech enhancement as well, as in [23]. There have also been a few studies that used CNNs for front-end speech denoising [24,25,26]. In the last of these, the authors used a single "bypass" connection from the encoder to the decoder, but none of the models described here can be said to use residual connections.…”
Section: Prior Workmentioning
confidence: 99%
“…Instead of using a masking function, the clean speech coefficients have also been used as target of a DNN in many approaches [6], [7], [36], [39]. For this, various architecture of DNNs have been employed, e.g., feed-forward networks [6], [7], recurrent neural networks [24] including long short-term memory cells [37], [40]- [42], generative adversarial networks [43], [44], convolutional neural networks [27], [29], [45] and WaveNet based approaches [28], [46].…”
Section: Introductionmentioning
confidence: 99%