1996
DOI: 10.2172/431136
|View full text |Cite
|
Sign up to set email alerts
|

Improving on hidden Markov models: An articulatorily constrained, maximum likelihood approach to speech recognition and speech coding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

1997
1997
2019
2019

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 23 publications
0
5
0
Order By: Relevance
“…It was observed that smoothing the estimated articulatory trajectories improved estimation quality and the correlation and reduced the RMSE. This is a direct consequence of the observation made in [52], which claimed that articulatory motions are predominantly low pass in nature with a cutoff frequency of 15 Hz. This led us to introduce a Kalman smoother-based postprocessor in the architectures discussed above.…”
Section: Machine Learning Approaches For Speech Inversionmentioning
confidence: 55%
See 1 more Smart Citation
“…It was observed that smoothing the estimated articulatory trajectories improved estimation quality and the correlation and reduced the RMSE. This is a direct consequence of the observation made in [52], which claimed that articulatory motions are predominantly low pass in nature with a cutoff frequency of 15 Hz. This led us to introduce a Kalman smoother-based postprocessor in the architectures discussed above.…”
Section: Machine Learning Approaches For Speech Inversionmentioning
confidence: 55%
“…Human articulator movements are predominantly low pass in nature [52] and the articulatory trajectories usually have a smoother path, defined by one that does not have any Fourier components over the cutoff frequency of 15 Hz. Nonlinear AR-ANN shown in Fig.…”
Section: Machine Learning Approaches For Speech Inversionmentioning
confidence: 99%
“…As described in other documents (Hogden, 1996), MALCOM is an algorithm that can be used to estimate the probability of a sequence of categorical data. MALCOM can also be applied to speech (and other real valued sequences) if windows of the speech are first categorized using a technique such as vector quantization (Gray, 1984).…”
Section: Cmentioning
confidence: 99%
“…These properties me not unique to HMMs, however. Maximum Likelihood Continuity Mapping (MALCOM), outlined below, is also a stochastic model with learning rules that allow training on large volumes of data (1). The main difference between MALCOM models and HMMs is that the model underlying MALCOM is intended to incorporate important constraints on articulator motions, and therefore better reflect the generative processes underlying speech.…”
Section: Introductionmentioning
confidence: 99%