[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing 1992
DOI: 10.1109/icassp.1992.225848
|View full text |Cite
|
Sign up to set email alerts
|

Phonemic HMM constrained by statistical VQ-code transition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

1993
1993
1999
1999

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 2 publications
0
6
0
Order By: Relevance
“…The reasons for this phenomenon that the incorporation of frame correlations caused even more errors than the baseline system can be explained in several ways. First, as is known from [12], the characteristics of frame correlations can be considered highly speaker-dependent. However, in our experiments, all the test speakers were different from the training speakers, and the frame correlations were used in speaker-independent mode.…”
Section: Experimental Results For Phoneme-independent Frame Correlmentioning
confidence: 99%
See 4 more Smart Citations
“…The reasons for this phenomenon that the incorporation of frame correlations caused even more errors than the baseline system can be explained in several ways. First, as is known from [12], the characteristics of frame correlations can be considered highly speaker-dependent. However, in our experiments, all the test speakers were different from the training speakers, and the frame correlations were used in speaker-independent mode.…”
Section: Experimental Results For Phoneme-independent Frame Correlmentioning
confidence: 99%
“…But, it has been known that the correlation PD's obtained in such a way tend to concentrate on the correlation characteristics of the frequently observed phonemes. To compensate for this, in [12], the correlation PD's are obtained by pooling the cooccurrence counts with equal contribution of each phoneme, and they are shown to be better empirically than those obtained by simple pooling. Assume that there are phonemes, used as units for recognition.…”
Section: Frame-correlation Pdmentioning
confidence: 99%
See 3 more Smart Citations