Lip-Reading Technique Using Spatio-Temporal Templates and Support Vector Machines

Yau, Wai Chee; Kumar, Deepak; Chinnadurai, Tharangini

doi:10.1007/978-3-540-85920-8_74

Cited by 6 publications

(2 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since the ASR system needs to treat various lengths of temporal features, dynamic time warping methods [13] or hidden Markov models (HMM) [14,15] have been widely used to handle the temporal data. Other researchers have used artificial neural networks (ANN) [16,17] and support vector machine (SVM) [18] that are renowned for its excellent generalization performance. However, these methods are not ideal for lip reading applications because they require a huge amount of training data, and would require extensive retraining any time a new word class is added to the database.…”

Section: Introductionmentioning

confidence: 99%

Real-time lip reading system for isolated Korean word recognition

Shin

Liu

Kim

2011

Pattern Recognition

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Real-time lip reading system for isolated Korean word recognition

Shin

Liu

Kim

2011

Pattern Recognition

View full text Add to dashboard Cite

“…Various variants of HMMs have also been used for audio-visual ASR, such as HMMs with non-Gaussian continuous observation probabilities [39]. Moreover, additional methods to overcome the difference in the speed of speaking for classification have been employed in audio-visual ASR systems, such as dynamic time warping (DTW), used by Petajan [4] are computationally expensive and inaccurate, while other classifiers that allow the difference among speakers to be considered for classifying the visual data have used artificial neural networks (ANN) [40,41], hybrid ANN-DTW systems [42], hybrid ANN-HMM [43] and recently the support vector machines (SVM) [44]. SVM is based on the structural risk minimization principle in contrast to empirical risk minimization on which many classifiers are based.…”

Section: Tvc751_sourcementioning

confidence: 99%

Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments

2012

View full text Add to dashboard Cite

Appearance-based visual speech recognition using only video signals is presented. The proposed technique is based on the use of directional motion history images (DMHIs), which is an extension of the popular optical flow method for object tracking. Zernike moments of each DMHI are computed in order to perform the classification. The technique incorporates automatic temporal segmentation of isolated utterances. The segmentation of isolated utterance is achieved using pair-wise pixel comparison. Support vector machine is used for classification and the results are based on leave-one-out paradigm.Experimental results show that the proposed technique achieves better performance in visemes recognition than others reported in literature. The benefit of this proposed visual speech recognition method is that it is suitable for real-time applications due to quick motion tracking system and the fast classification method employed. It has applications in command and control using lip movement to text conversion and can be used in noisy environment and also for assisting speech impaired persons.

show abstract

Lips Recognition for Biometrics

Choraś

2009

Advances in Biometrics

View full text Add to dashboard Cite

According to information researchers there will be 200 Trillion Petabites of information up to 2020, which is exceptionally enormous. So security of this information is additionally compulsory. There are a few approaches to keep the information secure however the security in view of biometric is exceptionally famous and secure. As we as a whole realize that the nature gives us a few unmistakable elements. By utilizing these components researchers found security framework which is known as biometric security framework. Biometric security framework is secure and safe security nowadays. There are huge strategies to actualize security. In this paper we will discuss execution of unconstrained biometric security of pericular acknowledgment. Unconstrained means the security framework will work in non helpful condition. There are gigantic unconstrained condition and we will discover the resultant of pericular acknowledgment in posture shrewd way i.e. to acknowledgment periculars i.e. Eye Corners, when a man in at around 30 level of stance point from Left Side and Right Side.

show abstract

Lip-Reading Technique Using Spatio-Temporal Templates and Support Vector Machines

Cited by 6 publications

References 14 publications

Real-time lip reading system for isolated Korean word recognition

Real-time lip reading system for isolated Korean word recognition

Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments

Lips Recognition for Biometrics

Contact Info

Product

Resources

About