2012
DOI: 10.1109/tasl.2012.2191956
|View full text |Cite
|
Sign up to set email alerts
|

Language Identification Using Visual Features

Abstract: Abstract-Automatic visual language identification (VLID) is the technology of using information derived from the visual appearance and movement of the speech articulators to identify the language being spoken, without the use of any audio information. This technique for language identification (LID) is useful in situations in which conventional audio processing is ineffective (very noisy environments), or impossible (no audio signal is available). Research in this field is also beneficial in the related field … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
10
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 30 publications
2
10
0
Order By: Relevance
“…The same assumption holds for the test data as well. This behaviour is in conformity with evidence from research in visual-only language identification, according to which performance increases with the length of speech data [22].…”
Section: Text-independent Experimentssupporting
confidence: 88%
See 2 more Smart Citations
“…The same assumption holds for the test data as well. This behaviour is in conformity with evidence from research in visual-only language identification, according to which performance increases with the length of speech data [22].…”
Section: Text-independent Experimentssupporting
confidence: 88%
“…The authors show through a series of experimental scenarios that observers perform much higher than chance level on visual discrimination between English and Spanish, even when the visual speech segments are presented to them in reverse temporal order. Recently, Newman and Cox [22] present an automated approach for both speaker-dependent and speaker-independent visual-only language identification, based on features that capture phonology and phonotactics characteristics of visual speech. They use Active Shape Models and Active Appearance Models (ASM, AAM) [32] for visual feature extraction, which is carried out frame-wise, and then feature vectors are "tokenised" in visually-transcribed phonemes.…”
Section: B Visual Speech and Accentmentioning
confidence: 99%
See 1 more Smart Citation
“…A tutorial on LID has been presented in [2] in which syntactic, morphological, and acoustic, phonetic, phonotactic, and prosodic level information have been discussed in details. Around 87 prosodic features has been used for LID system in [3] which provides better recognition performance, while [4] utilizes visual features with error rate less than 10%. In [5], a highly accurate and computationally efficient framework of i-vector presentation is proposed for rapid language identification.…”
Section: Introductionmentioning
confidence: 99%
“…However, the beneficial role of visual information to speech comprehension has been well documented and experimentally validated [12]. Specifically, automated visual-only approaches have been developed for language identification [8]. Another study shows that visual identification of accent is feasible for human observers [5].…”
Section: Introductionmentioning
confidence: 99%