ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.1987.1169628
|View full text |Cite
|
Sign up to set email alerts
|

On the automatic segmentation of speech signals

Abstract: For large vocabulary and continuous speech recognition, the subword-unit-based approach is a viable alternative to the wholeword-unit-based approach. For preparing a large inventory of subword units, an automatic segmentation is preferrable to manual segmentation as it substantially reduces the work associated with the generation of templates and gives more consistent results. In this paper we discuss some methods for automatically segmenting speech into phonetic units. Three different approaches are described… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
45
0
8

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 105 publications
(54 citation statements)
references
References 9 publications
1
45
0
8
Order By: Relevance
“…We would have expected better segmentation performance using WED or MD as they give a more accurate measure of acoustic homogeneity within a segment, by vime of modeling the vectors within a segment by a multi--ate Gaussian. The D(rn,T) (3) for La, WED and MD show that this expectation is valid in terms of the distortion associated with the ML segmentation, i.e., WED and MD show five times lower error in terms ofthe overall distortion obtained, though not offering improved segment match with manual segmentation. However, the poorer segmentation performance of WED and MD may be attributed to poor covariance matrix estimation with the limited data in each segment or that the vector elements themselves are noisy.…”
Section: Segmentation Evaluationmentioning
confidence: 90%
See 1 more Smart Citation
“…We would have expected better segmentation performance using WED or MD as they give a more accurate measure of acoustic homogeneity within a segment, by vime of modeling the vectors within a segment by a multi--ate Gaussian. The D(rn,T) (3) for La, WED and MD show that this expectation is valid in terms of the distortion associated with the ML segmentation, i.e., WED and MD show five times lower error in terms ofthe overall distortion obtained, though not offering improved segment match with manual segmentation. However, the poorer segmentation performance of WED and MD may be attributed to poor covariance matrix estimation with the limited data in each segment or that the vector elements themselves are noisy.…”
Section: Segmentation Evaluationmentioning
confidence: 90%
“…Our facus, in this paper, is on two basic segmentation techniques suitable for ASWU systems, namely, i) spectral transition measure (STM) based segmentation and, ii) maximum likelihood (ML) segmentation [3]. Of these, while STM offers simplicity, the ML segmentation has gained sufficient acceptance in practically most ASWU based speech recognition systems reported so far [I].…”
Section: Introductionmentioning
confidence: 99%
“…Apresenta como principais vantagens a rapidez e o fato de os erros serem previsíveis e sistemáticos, podendo ser, portanto, reduzidos. Esse sistema de segmentação é amplamente utilizado para a segmentação fonética da fala em unidades menores do que a palavra (SVENDSEN;SOONG, 1987;van HEMERT, 1991;BARBOSA, 2006), mas é ainda muito pouco aplicado para segmentar unidades maiores do que a palavra, ou seja, enunciados e unidades tonais.…”
Section: Relação Entre Pausas E Quebras Prosódicasunclassified
“…The method cuts the actions into segments such that the global distortion of these segments w.r.t their means is minimized. This can be formulated as a dynamic programming problem [23].…”
Section: Temporal Segmentationmentioning
confidence: 99%