2006
DOI: 10.1016/j.specom.2004.12.002
|View full text |Cite
|
Sign up to set email alerts
|

Sub-banded reconstructed phase spaces for speech recognition

Abstract: ii Preface A novel method for classification of speech phonemes, based on the combination of dynamical systems theory and filter banks, is introduced. The benefit of this approach is seen in its ability to model nonlinear characteristics of speech, something that traditional methods cannot do. The modeling tool that provides this capability is the reconstructed phase space. This space carries all the dynamical information present in the signal's underlying system. The reconstructed phase spaces used for modeli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2006
2006
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 43 publications
0
7
0
Order By: Relevance
“…We have shown in previous work that even when these assumptions are not met RPSs contain important information for classifying a signal [9]. Here these signals are the 12 lead ECGs, and the classes are infarcted or non-infarcted.…”
Section: Rps/gmm Approachmentioning
confidence: 98%
See 1 more Smart Citation
“…We have shown in previous work that even when these assumptions are not met RPSs contain important information for classifying a signal [9]. Here these signals are the 12 lead ECGs, and the classes are infarcted or non-infarcted.…”
Section: Rps/gmm Approachmentioning
confidence: 98%
“…GMMs are a set of Gaussian probability density functions used to characterize the distribution of an underlying set of data. They are widely used in engineering applications, especially speech [9]. The equation that defines a GMM is [2]:…”
Section: Rps/gmm Approachmentioning
confidence: 99%
“…In [17], reconstructed phase spaces built from speech signals that have been subband filtered were used for isolated phoneme classification, showing improved recognition accuracies over fullband signal phase space reconstruction features. However, this approach is infeasible for continuous speech recognition because of its high computational complexity.…”
Section: Nonlinear Features For Speech Recognitionmentioning
confidence: 99%
“…Unfortunately, such a model cannot convey nonlinear 3D fluid dynamics phenomena of speech [20,21]. In order to fill the existing gap between this ideal linear deterministic model and real strongly unpredictable speech production process, non-linear processing techniques can be used [22,23,24,25].In recent years, reconstructed phase space (RPS) of speech has been used for speech recognition [26,27], speech enhancement [26,27]and detecting sleepiness [30]. Moreover, nonlinear dynamics features extracted from RPS of speech has been employed in SER.It has been shown that geometrical properties of RPScontain important emotional cues of speaker [24,25].…”
Section: Introductionmentioning
confidence: 99%