2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 2007
DOI: 10.1109/icassp.2007.366995
|View full text |Cite
|
Sign up to set email alerts
|

An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling

Abstract: The so-called tandem approach, where the posteriors of a multilayer perceptron (MLP) classi er are used as features in an automatic speech recognition (ASR) system has proven to be a very effective method. Most tandem approaches up to date have relied on MLPs trained for phone classi cation, and appended the posterior features to some standard feature hidden Markov model (HMM). In this paper, we develop an alternative tandem approach based on MLPs trained for articulatory feature (AF) classi cation. We also de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2007
2007
2021
2021

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(44 citation statements)
references
References 9 publications
0
44
0
Order By: Relevance
“…These are not exclusive; many systems often concatenate both acoustic features and posteriors derived from them as inputs to a system (e.g., [9]). • Statistical model: HMMs correspond to a family of generative models; that is, the statistical models are formulated so that they model data generation, requiring a likelihood of the data as well as a model prior to compute model posteriors.…”
Section: Prior Workmentioning
confidence: 99%
See 1 more Smart Citation
“…These are not exclusive; many systems often concatenate both acoustic features and posteriors derived from them as inputs to a system (e.g., [9]). • Statistical model: HMMs correspond to a family of generative models; that is, the statistical models are formulated so that they model data generation, requiring a likelihood of the data as well as a model prior to compute model posteriors.…”
Section: Prior Workmentioning
confidence: 99%
“…In addition to the Hybrid system, [4] also describes a system that uses outputs of a set of phonological attribute classifiers in an HMM system; this style of system has been further explored in [8] and in [9]. 3 As the Tandem HMM system is built using mixtures of Gaussians to describe state emission probabilities, either the correlated phonological feature inputs must first be decorrelated before being fed into the system or the system must make use of full or semi-tied covariance matrices and suffer an explosion in parameters and required training data.…”
Section: Prior Workmentioning
confidence: 99%
“…The 500 word baseline system has a small improvement over the results presented in [13]. Note that systems that train with additional data outside the designated SVitchboard training sets have reported lower word error rates [13,20].…”
Section: Resultsmentioning
confidence: 82%
“…Most tandem features to date have relied on MLPs trained for phone classification. Cetin et al [105] illustrated on a relatively small data set that MLPs trained for articulatory feature classification can be equally effective. They provided a similar comparison using MLPs trained on a much larger data set -2,000 h of English conversational telephone speech.…”
Section: Speech and Audio Processingmentioning
confidence: 99%