2022
DOI: 10.1007/s11277-022-09640-y
|View full text |Cite
|
Sign up to set email alerts
|

Survey of Deep Learning Paradigms for Speech Processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
20
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 70 publications
(20 citation statements)
references
References 134 publications
0
20
0
Order By: Relevance
“…Also, it provides the pollution free nature of the HEV by selection of pollution free sources for the powering the HEV. In recent years, deep learning algorithms have shown noteworthy contributions in various signal processing applications because of their faster conversions, high accuracy, reliability, and effectiveness [6], [7], [12]. In the future, various deep learning-based systems can be employed for driving and vehicle condition data augmentation to create the synthetic data for the simulation using available limited datasets [8], [9], [13].…”
Section: ░ 4 Simulation Results and Discussionmentioning
confidence: 99%
“…Also, it provides the pollution free nature of the HEV by selection of pollution free sources for the powering the HEV. In recent years, deep learning algorithms have shown noteworthy contributions in various signal processing applications because of their faster conversions, high accuracy, reliability, and effectiveness [6], [7], [12]. In the future, various deep learning-based systems can be employed for driving and vehicle condition data augmentation to create the synthetic data for the simulation using available limited datasets [8], [9], [13].…”
Section: ░ 4 Simulation Results and Discussionmentioning
confidence: 99%
“…In CNN‐based speech processing, the algorithm first extracts the Mel‐frequency cepstral coefficients (MFCC) features from the original speech data and takes them as an input (Figure 15e). [ 65,139,154 ] Then, the convolutional layer extracts various features from the input through the learnable filters, that is, the kernels, to create feature maps. The pooling layer follows the convolutional layer to decrease the size of the convoluted feature map to reduce the computational costs.…”
Section: For Soft Acoustic/vibration Sensorsmentioning
confidence: 99%
“…A comprehensive review of all ML and DL techniques used for speech processing is beyond the scope of this paper, and they have been already covered in a number of existing reviews. [137][138][139] Therefore, in this section, we will primarily focus on the ML and DL algorithms which were successfully used together with soft acoustic/vibration sensors to conduct speech processing in the literature thus far. Both traditional ML algorithms (e.g., linear discriminant analysis [LDA], random forest [RF], and Gaussian mixture model [GMM]) and DL algorithms (e.g., FNN and CNN) have demonstrated excellent capability to process the signals coming from soft acoustic/vibration sensors to perform speech recognition.…”
Section: Algorithms For Soft Acoustic/vibration Sensorsmentioning
confidence: 99%
“…A comprehensive review with background introduction and formulation of speech separation and components of supervised separation, i.e., learning machines, training targets, and acoustic features, have been introduced with a description of monaural speech enhancement, speaker separation, and speech de-reverberation as well as multimicrophone techniques in [17]. The articles [17], [32], [33], [34] presented interesting reviews of deep learning applied to various problems of speech processing. Nevertheless, these review articles presented speaker separation using deep learning in the T-F domain only in a short portion of the overview.…”
Section: Introductionmentioning
confidence: 99%