2011
DOI: 10.1109/tasl.2010.2073704
|View full text |Cite
|
Sign up to set email alerts
|

Time-Varying Autoregressions in Speech: Detection Theory and Applications

Abstract: This article develops a general detection theory for speech analysis based on time-varying autoregressive models, which themselves generalize the classical linear predictive speech analysis framework. This theory leads to a computationally efficient decision-theoretic procedure that may be applied to detect the presence of vocal tract variation in speech waveform data. A corresponding generalized likelihood ratio test is derived and studied both empirically for short data records, using formant-like synthetic … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
25
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(25 citation statements)
references
References 47 publications
0
25
0
Order By: Relevance
“…In addition, it is noted that all the linear predictive analyses involved in the present study are classical in the sense that they use time-invariant filter coefficients that are updated once per frame. A more flexible paradigm is to utilize time-varying AR-modeling (e.g., Schnell and Lacroix, 2008;Rudoy et al, 2011) in which linear predictive filter coefficients evolve in time. Combining the proposed WLP-AME method with the time-varying AR modeling approach is another topic of future studies which would maybe help in detecting vocal tract variation in continuous high-pitched speech.…”
Section: Discussionmentioning
confidence: 99%
“…In addition, it is noted that all the linear predictive analyses involved in the present study are classical in the sense that they use time-invariant filter coefficients that are updated once per frame. A more flexible paradigm is to utilize time-varying AR-modeling (e.g., Schnell and Lacroix, 2008;Rudoy et al, 2011) in which linear predictive filter coefficients evolve in time. Combining the proposed WLP-AME method with the time-varying AR modeling approach is another topic of future studies which would maybe help in detecting vocal tract variation in continuous high-pitched speech.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, we propose to modify the 2DAR model by replacing TDLP with time-varying linear prediction (TVLP) [11,12] which is a generalization of conventional linear prediction (LP) [13]. TVLP can be used to analyze non-stationarity of speech signals by allowing the underlying all-pole model to be time-varying.…”
Section: Introductionmentioning
confidence: 99%
“…The other automatic segmentation group consists of algorithms which do not require any knowledge about the phonetic content and are based mostly on statistical signal analysis (e.g., Tyagi et al 2006;Almanidis et al, 2008Almanidis et al, , 2009Scharenborg et al, 2010;Rudoy et al, 2011). Tyagi et al (2006) employed speech signal modelling based on the autoregressive process.…”
Section: Introductionmentioning
confidence: 99%
“…In the work by Scharenborg et al (2010), the first step is also MFCC parameterisation, followed by segmentation based on the principle of finding the maximum distances between observation vectors in the selected subset, performed with the use of a method known as Maximum Margin Clustering (MMC). Finally, Rudoy et al (2011) used stochastic modelling employing the standard AutoRegressive (AR) and Time-Varying AutoRegressive (TVAR) models. Detection is performed with the use of a classic GLRT test based on determining which model is more adequate for a particular segment of the signal.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation