2018
DOI: 10.1007/s11277-018-5930-z
|View full text |Cite
|
Sign up to set email alerts
|

Visual Speech Recognition Using Optical Flow and Hidden Markov Model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 19 publications
0
8
0
Order By: Relevance
“…The model achieved 71% in word recognition accuracy for the CUAVE dataset [23]. Sharma et al [24] have investigated audio-visual speech recognition for the Hindi language by utilizing the Mel Frequency Cepstral Coefficient (MFCC) for audio features. For visual features, they have used the Optical Flow (OF) and conventional lip localization approach, and then it is followed by a HMM for recognition.…”
Section: Handcrafted Features Based Modelsmentioning
confidence: 99%
“…The model achieved 71% in word recognition accuracy for the CUAVE dataset [23]. Sharma et al [24] have investigated audio-visual speech recognition for the Hindi language by utilizing the Mel Frequency Cepstral Coefficient (MFCC) for audio features. For visual features, they have used the Optical Flow (OF) and conventional lip localization approach, and then it is followed by a HMM for recognition.…”
Section: Handcrafted Features Based Modelsmentioning
confidence: 99%
“…Visual feature extraction can be mainly classified as modal-based and imagebased feature extraction. In model-based feature extraction, the Active Appearance Model (AAM) gained more attention [76,80] [75,78,79]. In, audio-visual fusion preferred models are the Hidden Markov Model (HMM) and the Multi-Stream Hidden Markov Model (MSHMM) [75,76,77,78,79].…”
Section: Audio-visual Speech Recognitionmentioning
confidence: 99%
“…In model-based feature extraction, the Active Appearance Model (AAM) gained more attention [76,80] [75,78,79]. In, audio-visual fusion preferred models are the Hidden Markov Model (HMM) and the Multi-Stream Hidden Markov Model (MSHMM) [75,76,77,78,79]. The introduction of deep learning models enhanced the performance of ASR, VSR, and AVSR systems.…”
Section: Audio-visual Speech Recognitionmentioning
confidence: 99%
“…Cooperation and collaboration is another important application of robotics (Naik, 2020;Palaz, Magimai-Doss, Collobert, 2019;Sharma, 2019]. The paper focuses on another dimension of robotics application -speech recognition for care and support.…”
Section: Related Workmentioning
confidence: 99%