2007
DOI: 10.1109/tasl.2006.885928
|View full text |Cite
|
Sign up to set email alerts
|

Perceptual Long-Term Variable-Rate Sinusoidal Modeling of Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
3
2
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 14 publications
(18 citation statements)
references
References 27 publications
0
18
0
Order By: Relevance
“…This model was first used for cepstral modeling in [28], [35]. Then it was applied to the LT modeling of harmonic parameters in [18], [19] and LP parameters in [20]. The DCM is defined as follows:X…”
Section: B Discrete Cosine Model (Dcm) For the Lt-modeling Of The Stmentioning
confidence: 99%
See 1 more Smart Citation
“…This model was first used for cepstral modeling in [28], [35]. Then it was applied to the LT modeling of harmonic parameters in [18], [19] and LP parameters in [20]. The DCM is defined as follows:X…”
Section: B Discrete Cosine Model (Dcm) For the Lt-modeling Of The Stmentioning
confidence: 99%
“…At the same time, it was proposed in [18] to model the LT trajectory of sinusoidal speech parameters (the phase and the amplitude of each harmonic) with a Discrete Cosine Model (DCM). In contrast to [17], where the length of parameter trajectories and the order of the model were fixed, in [18] the long-term frames are continuously voiced sections of speech, which exhibit very variable size and "shape": such a section can contain several phonemes or syllables. Therefore, the LT-DCM is adjusted to F. Ben Ali and S. Djaziri-Larbi are with Université Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis, Signals and Systems Lab, Tunis, Tunisia.…”
Section: Introductionmentioning
confidence: 99%
“…The secret key is generated using the indices of VQ corresponding to the neighboring frames derived from the natural speech's characters. (Girin, 2007) presents an encryption algorithm based on time-trajectory model of the sinusoidal components corresponding to voiced speech signals. This method uses the amplitude and phase parameters of the discrete cosine functions which are applied for each voiced segment of the speech.…”
Section: Related Workmentioning
confidence: 99%
“…This is because in the proposed dictionary, only voiced speech frames can be sparsely represented and thus the deviation between the V/NV metric computed for voiced and nonvoiced frames is generally large. In order to have fast implementation, DCT can be preferred, but in this case the information of the speech source (sparse vector) and speech production system (dictionary) is coupled (Girin et al, 2007). This coupling produces ambiguous results for characterizing voiced sounds in certain cases, which results in a lower V/NV accuracy.…”
Section: Robustness Of Warped-lp Dictionary For V/nv Detectionmentioning
confidence: 99%