2019
DOI: 10.1007/s00034-019-01157-3
|View full text |Cite
|
Sign up to set email alerts
|

A Primer on Deep Learning Architectures and Applications in Speech Processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(11 citation statements)
references
References 55 publications
0
11
0
Order By: Relevance
“…Recurrent neural networks (RNNs) are a group of DL models employed in speech recognition (Ogunfunmi et al, 2019 ), natural language processing (Deng and Liu, 2018 ), and biomedical signal processing (Vicnesh et al, 2020 ; Baygin et al, 2021 ). CNN models are of Feed-Forward types.…”
Section: Methodsmentioning
confidence: 99%
“…Recurrent neural networks (RNNs) are a group of DL models employed in speech recognition (Ogunfunmi et al, 2019 ), natural language processing (Deng and Liu, 2018 ), and biomedical signal processing (Vicnesh et al, 2020 ; Baygin et al, 2021 ). CNN models are of Feed-Forward types.…”
Section: Methodsmentioning
confidence: 99%
“…Deep learning has been applied to various aspects of speech processing, speech recognition, speech and speaker identification and such applications [ 111 ]. In this subsection, we focus on speech source separation applications using variational autoencoders.…”
Section: Applicationsmentioning
confidence: 99%
“…The spectrogram is found by taking the square of the magnitude of the STFT especially for deep learning algorithms involving speech. Typically, in practice, the spectrogram is often normalized before being fed into a neural network [ 111 ]. Alternative inputs include log spectrograms and mel spectrograms.…”
Section: Applicationsmentioning
confidence: 99%
“…The more representative is the deep learning method. It has been recently found that deep learning [17,18] has achieved remarkable success in many speech processing fields with its excellent learning performance. The representative technology is DNN-HMM hybrid structure [19,20], replacing the conventional acoustic modeling based on GMM and HMM.…”
Section: Introductionmentioning
confidence: 99%