Proceedings of the 11th Joint Conference on Information Sciences (JCIS) 2008
DOI: 10.2991/jcis.2008.61
|View full text |Cite
|
Sign up to set email alerts
|

HIT-AVDB-II: A New Multi-view and Extreme Feature Cases Contained Audio-Visual Database for Biometrics

Abstract: For research on the proper law of audiovisual speech and biometrics technology, and evaluation of algorithms and systems, we construct a multi-language and multiview database HIT-AVDB-II with a corpus of various common and special sentences include Chinese and English poems, tongue twister, digits, Greek alphabet and music. The HIT-AVDB-II is ready to facilitate the investigation of multi-view biometrics technology and visual speech reading. HIT-AVDB-II contains formal and extreme feature cases for study. For … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 13 publications
0
4
0
Order By: Relevance
“…Lip-reading datasets with people pronouncing sentences in other languages have also been created too. Examples include AV@CAR [15] and VLRF for Spanish, AVAS [16] and AVSD [20] for Arabic, BL [23] and IV2 [37] for French, UWB-05-HSAVC [57], and UWB-07-ICAV [58] for Czech, the German NDUTAVSC [49] dataset, the Russian HAVRUS [32] corpus and the HIT-AVDB-II [33] database that covers Chinese and English.…”
Section: B Word and Sentence Recognitionmentioning
confidence: 99%
“…Lip-reading datasets with people pronouncing sentences in other languages have also been created too. Examples include AV@CAR [15] and VLRF for Spanish, AVAS [16] and AVSD [20] for Arabic, BL [23] and IV2 [37] for French, UWB-05-HSAVC [57], and UWB-07-ICAV [58] for Czech, the German NDUTAVSC [49] dataset, the Russian HAVRUS [32] corpus and the HIT-AVDB-II [33] database that covers Chinese and English.…”
Section: B Word and Sentence Recognitionmentioning
confidence: 99%
“…During the acquisition process, each speaker read each sentence three times at an even speed. Then HIT-AVDB-II [191] database was collected from Chinese poems, which contained 30 people, each reading 11 Chinese poems. IV2 [192] database was a sentence level database based on French, with 300 people participating in the recording, each speaking 15 French sentences.…”
Section: ) Word Phrase and Sentence Recognitionmentioning
confidence: 99%
“…AV Digital [119] database placed the camera at three angles of 0 °, 45 °, and 90 °. The HIT-AVDB-II [191] and LTS5 [170] collected view data at 0 °, 30 °, 60 °, and 90 °. LILiR [179] and OuluVS2 [173] collected view data at 0 °, 30 °, 45 °, 60 ° and 90 °.…”
Section: ) Multi View Databasesmentioning
confidence: 99%
“…A list of commonly-used English language AVSR databases is given in Table I [7], [11], AND [12]) ( Speech Recognition) medium to large-vocabulary continuous speech recognition: AV-TIMIT, GRID, VidTIMIT, IBM LVCSR and AusTalk. Of these, only GRID and VidTIMIT are currently available: AV-TIMIT and IBM LVCSR have not been released, while AusTalk is not yet available though a release is planned.…”
Section: Introductionmentioning
confidence: 99%