11th IEEE International Conference on Control &Amp; Automation (ICCA) 2014
DOI: 10.1109/icca.2014.6871132
|View full text |Cite
|
Sign up to set email alerts
|

Cross-language transfer speech recognition using deep learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…One way to train a CNN for speech recognition is through the use of spectrograms as input images, (Li et al , 2013). Various techniques have been investigated to the end of building robust speech recognizers and/or to cater for multiple languages (Seki et al , 2017; Wu et al , 2017; Kundu et al , 2016; Chen and Mak, 2015; Zhao et al , 2014); however, in this context, a simple speech recognition was required, with the ability to categorize audio inputs as one of seven possibilities, namely, “Yes”, “No”, “Okay”, “Don’t”, “Wait”, “Stop” and Negative. The first six categories included two positive answers and four negative answers, given that the monitoring system shall ask predefined yes/no questions to the operator when in need of further clarification, and the negative category consisted of examples of background noise.…”
Section: Building the Individual Systemsmentioning
confidence: 99%
“…One way to train a CNN for speech recognition is through the use of spectrograms as input images, (Li et al , 2013). Various techniques have been investigated to the end of building robust speech recognizers and/or to cater for multiple languages (Seki et al , 2017; Wu et al , 2017; Kundu et al , 2016; Chen and Mak, 2015; Zhao et al , 2014); however, in this context, a simple speech recognition was required, with the ability to categorize audio inputs as one of seven possibilities, namely, “Yes”, “No”, “Okay”, “Don’t”, “Wait”, “Stop” and Negative. The first six categories included two positive answers and four negative answers, given that the monitoring system shall ask predefined yes/no questions to the operator when in need of further clarification, and the negative category consisted of examples of background noise.…”
Section: Building the Individual Systemsmentioning
confidence: 99%