2017
DOI: 10.1007/978-981-10-6451-7_14
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Model Validation Techniques for Audio-Visual Speech Recognition

Abstract: This paper implements and compares the performance of a number of techniques proposed for improving the accuracy of Automatic Speech Recognition (ASR) systems. As ASR that uses only speech can be contaminated by environmental noise, in some applications it may be improve performance to employ Audio-Visual Speech Recognition (AVSR), in which recognition uses both audio information and mouth movements obtained from a video recording of the speaker's face region. In this paper, model validation techniques, namely… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 11 publications
0
2
0
Order By: Relevance
“…All the experiments result used the 'leave-one-out' cross validation (LOOCV) method to cross check the result. Based on previous research work [25], LOOCV techniques have been able to achieve a slightly improved accuracy compared to the more common Holdout and bootstrap validation approach. Furthermore, the LOOCV technique is able to allow an evaluation of every sample and obtain the final accuracy value by averaging the results from all samples.…”
Section: A Experiments Setupmentioning
confidence: 99%
“…All the experiments result used the 'leave-one-out' cross validation (LOOCV) method to cross check the result. Based on previous research work [25], LOOCV techniques have been able to achieve a slightly improved accuracy compared to the more common Holdout and bootstrap validation approach. Furthermore, the LOOCV technique is able to allow an evaluation of every sample and obtain the final accuracy value by averaging the results from all samples.…”
Section: A Experiments Setupmentioning
confidence: 99%
“…IOP Publishing doi:10.1088/1757-899X/1230/1/012020 2 HMMs allow to perform preliminary formation of a reference database of words or phrases and then to identify the presented sound images based on comparison with this database [6,7]. One of the types of HMMs is continuous HMMs, where transitions between hidden states and the arrival of observations can occur at arbitrary points of time [8,9].…”
Section: Introductionmentioning
confidence: 99%