2019
DOI: 10.1109/jstsp.2019.2913965
|View full text |Cite
|
Sign up to set email alerts
|

Modulation Filter Learning Using Deep Variational Networks for Robust Speech Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 33 publications
(16 citation statements)
references
References 37 publications
0
16
0
Order By: Relevance
“…During the last decade, deep learning has demonstrated to be an excellent technique in the field of AI. Deep learning methods have been used to solve various problems like image processing [27,28], speech recognition [29,30], and natural language processing [31,32]. As deep learning can learn robust and effective feature representation through layer-by-layer feature transformation of the original signal automatically, it has a good capability to cope with some challenges in the field of self-driving cars.…”
Section: Theoretical Background Of Deep Learning Methods Used For Selmentioning
confidence: 99%
“…During the last decade, deep learning has demonstrated to be an excellent technique in the field of AI. Deep learning methods have been used to solve various problems like image processing [27,28], speech recognition [29,30], and natural language processing [31,32]. As deep learning can learn robust and effective feature representation through layer-by-layer feature transformation of the original signal automatically, it has a good capability to cope with some challenges in the field of self-driving cars.…”
Section: Theoretical Background Of Deep Learning Methods Used For Selmentioning
confidence: 99%
“…When the output encounters the token <eos>, it indicates the end of decoding. The decoding process is performed with the beam search strategy [34]. In other words, following each output step, the Speller only keeps the top N paths with the highest probability among all decoding paths.…”
Section: B Decoding Strategymentioning
confidence: 99%
“…C ONVOLUTIONAL neural networks (CNNs) [1] are among the most effective machine learning approaches for processing structured, high-dimensional data such as voice recordings, images, and videos and have become indispensable in, e.g., speech recognition [2], [3], audio processing [4], and image classification [5].…”
Section: Introductionmentioning
confidence: 99%