2009
DOI: 10.1109/tasl.2008.2009162
|View full text |Cite
|
Sign up to set email alerts
|

Robust Detection of Phone Boundaries Using Model Selection Criteria With Few Observations

Abstract: Abstract-Automatic phone segmentation techniques based on model selection criteria are studied. We investigate the phone boundary detection efficiency of entropy-and Bayesian-based model selection criteria in continuous speech based on the DISTBIC hybrid segmentation algorithm. DISTBIC is a text-independent bottom-up approach that identifies sequential model changes by combining metric distances with statistical hypothesis testing. Using robust statistics and small sample corrections in the baseline DISTBIC al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
13
0
2

Year Published

2013
2013
2016
2016

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(15 citation statements)
references
References 34 publications
0
13
0
2
Order By: Relevance
“…The most common approach is based on the segmentation on the basis of changing of spectral variation function [2,31,32]. In the most available descriptions of the speech segmentation algorithms only own modality of the speech signal is used.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The most common approach is based on the segmentation on the basis of changing of spectral variation function [2,31,32]. In the most available descriptions of the speech segmentation algorithms only own modality of the speech signal is used.…”
Section: Introductionmentioning
confidence: 99%
“…The quality of the speech synthesis and recognition results depends considerably on its qualitative solution. Di erent algorithms for speech signal automatic segmentation are already available now [1][2][3][31][32][33][34]. Many available algorithms have their certain advantages and disadvantages.…”
Section: Introductionmentioning
confidence: 99%
“…These variances are a basis for formulating a Generalized Likelihood Ratio Test (GLRT) related to a change in the signal power spectrum density over the analysed period of time. Almanidis et al (2008;2009) presented a hybrid algorithm known as the Model Selection Criterion (MSC), using a Bayesian Information Criterion (BIC) and Kullback-Leibler information. This method requires MFCC parameterisation of the speech signal, constructing a model of speech signal segments and performing boundary detection.…”
Section: Introductionmentioning
confidence: 99%
“…The efficiency of defining interphonemic transitions by means of supervised algorithms, with the assumed tolerance of up to 20 ms, reaches 93%, while for unsupervised algorithms it is up to 75%, with the false acceptance error of 2% (Almanidis et al, 2008;2009). This advantage of supervised algorithms is a consequence of using information about the phonetic content.…”
Section: Introductionmentioning
confidence: 99%
“…Some papers also propose the text-independent phonetic segmentation as in [8,9], which detect phone boundaries only using acoustic features. However, our main target applications are those in which linguistic information is available, such as TTS or accent conversion.…”
Section: Introductionmentioning
confidence: 99%