A pandemic of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection broke out all over the world; however, epidemiological data and viral shedding in pediatric patients are limited. We conducted a retrospective, Chun-Zhen Hua and Zi-Ping Miao contributed equally to this study.
In this paper, a neural network-based lip reading system is proposed. The system is lexiconfree and uses purely visual cues. With only a limited number of visemes as classes to recognise, the system is designed to lip read sentences covering a wide range of vocabulary and to recognise words that may not be included in system training. The system has been testified on the challenging BBC Lip Reading Sentences 2 (LRS2) benchmark dataset. Compared with the state-of-the-art works in lip reading sentences, the system has achieved a significantly improved performance with 15% lower word error rate. In addition, experiments with videos of varying illumination have shown that the proposed model has a good robustness to varying levels of lighting. The main contributions of this paper are: 1) The classification of visemes in continuous speech using a specially designed transformer with a unique topology; 2) The use of visemes as a classification schema for lip reading sentences; and 3) The conversion of visemes to words using perplexity analysis. All the contributions serve to enhance the accuracy of lip reading sentences. The paper also provides an essential survey of the research area. INDEX TERMS deep learning, lip reading, neural networks, perplexity analysis, speech recognition.
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Highlights• This paper proposed a frame work based on the inequality constrained optimization model to learn conditional probability table parameters by incorporating expert judgments and Dirichlet priors.• We further improve the proposed method by developing a constrained Bayesian Dirichlet prior.• Combined the proposed method, we provide an improved expectation maximum algorithm for learning conditional probability table parameters from incomplete data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.