2020
DOI: 10.1155/2020/9452976
|View full text |Cite
|
Sign up to set email alerts
|

Nonlinear Dynamic Feature Extraction Based on Phase Space Reconstruction for the Classification of Speech and Emotion

Abstract: Due to the shortcomings of linear feature parameters in speech signals, and the limitations of existing time- and frequency-domain attribute features in characterizing the integrity of the speech information, in this paper, we propose a nonlinear method for feature extraction based on the phase space reconstruction (PSR) theory. First, the speech signal was analyzed using a nonlinear dynamic model. Then, the model was used to reconstruct a one-dimensional time speech signal. Finally, nonlinear dynamic (NLD) fe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 24 publications
0
3
0
Order By: Relevance
“…. , t n 􏼈 􏼉 represents the time series of the distribution of each data resource in a resource, F s represents the result of phase space reconstruction of each data resource structure, and ε represents the embedded dimension, the multidimensional state space of data resource structure is established based on the distribution point Q X of each data resource in high-dimensional space obtained in Section 3.4 and integrated into the theory of chaotic phase space reconstruction [35,36]:…”
Section: Optimization Of Distance English Teaching Resourcesmentioning
confidence: 99%
“…. , t n 􏼈 􏼉 represents the time series of the distribution of each data resource in a resource, F s represents the result of phase space reconstruction of each data resource structure, and ε represents the embedded dimension, the multidimensional state space of data resource structure is established based on the distribution point Q X of each data resource in high-dimensional space obtained in Section 3.4 and integrated into the theory of chaotic phase space reconstruction [35,36]:…”
Section: Optimization Of Distance English Teaching Resourcesmentioning
confidence: 99%
“…From the experiments so far (Tables 2, 3, 4,5,6,7,8,9,10,11), the evidence for RP embeddings being a standalone, nonlinear feature for speaker modeling is strong, and on par with the linear features (like MFCC). The improvement in performance in standalone RP based systems as the modes are combined, for unimodal, bimodal and trimodal systems is depicted in Fig.…”
Section: Resultsmentioning
confidence: 95%
“…These features however do not directly capture the non-linear dynamics of the speech signal, as they focus on extracting features related to the spectral envelop of the speech signal and inherently ignore the phase relationship between the frequency components 5 . Focusing on the piecewise linearity of the speech signal to extract features ignores crucial information on the non-linear dynamics, making the information being extracted incomplete 6 .…”
Section: Features For Speaker Recognitionmentioning
confidence: 99%