2014
DOI: 10.1007/s40313-014-0148-0
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent Genetic Fuzzy Inference System for Speech Recognition: An Approach from Low Order Feature Based on Discrete Cosine Transform

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…The speech signal processing aims to efficiently and accurately transform the acoustic speech signal for use in automatic systems. The extensive development of speech processing research demonstrates the effort to improve the performance of speech recognition systems for practical applications (Bellegarda and Monz 2016;Silva and Serra 2014). The use of such systems allows autonomy in areas as telephony, in which service requests are directed by voice commands (Cardoso et al 2010); in automotive engineering, by driving devices inside the cars (Qian et al 2009;Hua and Ng 2010;Li et al 2013); in computer systems, through computer utility programs, in addition to robotic application (Koo et al 2014) and in residential and hospital automation for accessibility of people with locomotive and visual disabilities (Gnanasekar et al 2012;Singh and Yadav 2015).…”
Section: Motivation and Justificationmentioning
confidence: 99%
See 1 more Smart Citation
“…The speech signal processing aims to efficiently and accurately transform the acoustic speech signal for use in automatic systems. The extensive development of speech processing research demonstrates the effort to improve the performance of speech recognition systems for practical applications (Bellegarda and Monz 2016;Silva and Serra 2014). The use of such systems allows autonomy in areas as telephony, in which service requests are directed by voice commands (Cardoso et al 2010); in automotive engineering, by driving devices inside the cars (Qian et al 2009;Hua and Ng 2010;Li et al 2013); in computer systems, through computer utility programs, in addition to robotic application (Koo et al 2014) and in residential and hospital automation for accessibility of people with locomotive and visual disabilities (Gnanasekar et al 2012;Singh and Yadav 2015).…”
Section: Motivation and Justificationmentioning
confidence: 99%
“…Finally, the representative coefficients of each pattern were encoded in a two-dimensional time matrix by the application of the discrete cosine transform (DCT). The DCT two-dimensional time matrices C jm kn encode the patterns of the speech commands, reproducing the local and global variations of the spectral envelope of the signals as well as presenting the local and global variations of the signal in the time domain (Silva and Serra 2014;Cao et al 2015).…”
Section: Multilevel Hierarchy Speech Pattern Recognition System Methomentioning
confidence: 99%
“…This difficulty increases when the number of estimates in a multiclass problem must be defined simultaneously with high accuracy, since the boundaries among different classes may not be well defined. Thus, new methodologies are proposed to obtain more robust results in multiclass tasks [7,18].…”
Section: Multiclass Learningmentioning
confidence: 99%
“…For the structure of the LVQ neural network, it was necessary to define the η learning rate and the n number of neurons of the competitive layer. The defined values in η set are often used in the specialized literature [17,18,26] and the n set was specified considering that the number of neurons in hidden layer should be greater than the number of inputs and greater than the number of neural network outputs. Because the vectors C N Jm , where N = {4, 9, 16} are mapped into a 30-dimensional space, the input of 15 LVQ experts is a set with 30 source nodes.…”
Section: Lvq Expertsmentioning
confidence: 99%
“…The term frame is used to determine the length of time between successive calculations of parameters. For speech processing, normally, the time frame is between 10ms and 30ms [14], [15].…”
Section: Pre-processing Of Speech Signalmentioning
confidence: 99%