2014
DOI: 10.1007/978-3-319-11581-8_6
|View full text |Cite
|
Sign up to set email alerts
|

A Framework for Recording Audio-Visual Speech Corpora with a Microphone and a High-Speed Camera

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
2

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 10 publications
0
1
0
2
Order By: Relevance
“…However, robust and reliable automatic Russian speech recognition systems, practically, do not exist. The development of Russian speech technologies is heavily influenced by the nature of the language, such as absence of strict grammatical constructions in sentences, huge amount of word formation rules, large number of exceptions, and the variability of Russian speech in the presence of dialects and accents [26].…”
Section: Datamentioning
confidence: 99%
“…However, robust and reliable automatic Russian speech recognition systems, practically, do not exist. The development of Russian speech technologies is heavily influenced by the nature of the language, such as absence of strict grammatical constructions in sentences, huge amount of word formation rules, large number of exceptions, and the variability of Russian speech in the presence of dialects and accents [26].…”
Section: Datamentioning
confidence: 99%
“…В работах [45][46][47][48] более подробно описаны подходы к извлечению визуальных признаков, исполь-зуемых в задачах определения контура губ говорящего, структурно-виземного анализа русской речи и др. В публикациях [49,50] также рассматриваются методы извлечения визуальных признаков в контексте задачи распознавания речи по губам.…”
Section: рис 1 общая структура аудиовизуальной системы распознаваниunclassified
“…Количество используемых виземных классов зависит от языка, и для русского обычно использова-лось от 10 до 14 классов [7][8][9]. В наших экспериментах мы использовали от 2 (разделение на гласные и согласные) до 48 виземных классов (по количеству фонем), с шагом 2.…”
unclassified