1996
DOI: 10.1007/978-3-662-13015-5_28
|View full text |Cite
|
Sign up to set email alerts
|

Active Shape Models for Visual Speech Feature Extraction

Abstract: Most approaches for lip modelling are based on heuristic constraints imposed by the user. We describe the use of Active Shape Models for extracting visual speech features for use by automatic speechreading systems, where the deformation of the lip model as well as image search is based on a priori knowledge learned from a training set. We demonstrate the robustness and accuracy of the technique for locating and tracking lips on a database consisting of a broad variety of talkers and lighting conditions.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
35
0

Year Published

1996
1996
2011
2011

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 62 publications
(35 citation statements)
references
References 4 publications
0
35
0
Order By: Relevance
“…They are obtained from the tracking results and serve as features for the recognition system. We have described the detailed feature extraction method elsewhere [16 ] [17 ].…”
Section: Feature Extractionmentioning
confidence: 99%
“…They are obtained from the tracking results and serve as features for the recognition system. We have described the detailed feature extraction method elsewhere [16 ] [17 ].…”
Section: Feature Extractionmentioning
confidence: 99%
“…Despite years of research attention, there has been limited success in creating a system that can reliably detect lips in unconstrained imagery. Existing systems employ methods such as snake and active shape models [5,6], Markov Random Field (MRF) techniques [7], and multi-class, shape-guided fuzzy c-means (FCM) clustering algorithm [8], to detect and locate lips within an image. While the results are commendable, the extensive calculations demanded by these methods are significant.…”
Section: Introductionmentioning
confidence: 99%
“…One of the early visual speech recognition systems was developed by Petajan [19] in which the shape of the lips was sampled by simple morphological features including height, width and area of the lips contour. Later in 1995, Luettin et al [6] applied Active Shape Models (ASM) in order to extract the lips outline and this information was used in the recognition of a set of standard English phonemes. A different approach was proposed by Harvey et al [17] where they applied a morphological transform called sieve that was applied to calculate simple one-dimensional (1D) and two-dimensional (2D) measurements that were employed to sample the lips shapes.…”
Section: Introductionmentioning
confidence: 99%