2022 International Conference on Machine Vision and Image Processing (MVIP) 2022
DOI: 10.1109/mvip53647.2022.9738749
|View full text |Cite
|
Sign up to set email alerts
|

Lip reading using external viseme decoding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…Chung et al [15] extended the WAS model to the MV-WAS model that can decode visual sequences across all poses and show that it is possible to read lips in profile, but the standard is inferior to reading frontal faces. Peymanfard et al [29] suggested external viseme decoding, which divides the sequence-to-sequence model into two stages, video to viseme and viseme to character, respectively. The network outputs character sequence given viseme through external text data.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Chung et al [15] extended the WAS model to the MV-WAS model that can decode visual sequences across all poses and show that it is possible to read lips in profile, but the standard is inferior to reading frontal faces. Peymanfard et al [29] suggested external viseme decoding, which divides the sequence-to-sequence model into two stages, video to viseme and viseme to character, respectively. The network outputs character sequence given viseme through external text data.…”
Section: Related Workmentioning
confidence: 99%
“…In this step, we expected the phonemes having the same visual features, like the same lip movements and forms, to be grouped in the same category. We used the method in [29] to train our model. Employing the two models of the video-to-viseme and the viseme-to-character models and finally merging the two, we obtained the lip reading model.…”
Section: Viseme Analysismentioning
confidence: 99%