Proceedings of the 44th IEEE 2001 Midwest Symposium on Circuits and Systems. MWSCAS 2001 (Cat. No.01CH37257)
DOI: 10.1109/mwscas.2001.986241
|View full text |Cite
|
Sign up to set email alerts
|

A new text-independent method for phoneme segmentation

Abstract: A new approach for text-independent speech segmentation is proposed. The novelty consists in a preprocessing based on critical-band perceptual analysis and an original algorithm for the individuation of phoneme boundaries. The results are promising since the method gives -74% of correct segmentation without presenting over-segmentation.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

4
50
0
5

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 63 publications
(59 citation statements)
references
References 4 publications
4
50
0
5
Order By: Relevance
“…Using a linguistically constrained hidden Markov model (HMM) based method, they yield over 85% boundary detection rate in noisefree environments at 20msec boundary misalignment (tolerance). Aversano et al (2001) introduce a novel approach for text-independent speech segmentation where the preprocessing is based on criticalband perceptual analysis. It results in 74% segmentation accuracy while limiting over-segmentation to a minimum.…”
Section: Phonemic Segmentationmentioning
confidence: 99%
“…Using a linguistically constrained hidden Markov model (HMM) based method, they yield over 85% boundary detection rate in noisefree environments at 20msec boundary misalignment (tolerance). Aversano et al (2001) introduce a novel approach for text-independent speech segmentation where the preprocessing is based on criticalband perceptual analysis. It results in 74% segmentation accuracy while limiting over-segmentation to a minimum.…”
Section: Phonemic Segmentationmentioning
confidence: 99%
“…Aversano et al 2001;Qiao et al 2008;Goldwater et al 2009;Scharenborg et al 2010). Such traditional segmentation measures cannot be used for evaluating phone acquisition because they either do not assign segments to phones (each segment being equally unrelated to all other segments); or because they classify segments in a trivial way: two segments are assumed to be in the same class if their sequences are identical and vice versa.…”
Section: Evaluation Measures For Segmentationmentioning
confidence: 99%
“…The center frequencies of the filter bank are uniformly distributed along the Bark scale, whereas the corresponding bandwidths are defined by (2). From these, a vector of Perceptual Critical Band Features (PCBF) [40] is computed as the log-energy of the acoustic signal:…”
Section: Barkmentioning
confidence: 99%