2019
DOI: 10.30534/ijatcse/2019/31852019
|View full text |Cite
|
Sign up to set email alerts
|

The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions

Abstract: Performance and classifications of the human speech recognition is bi-modular in nature and the expansion of visual data from the speaker's mouth area has been appeared to expand the presentation of the automatic speech recognition ASR frameworks. The actual performance and classifications of the audio visual speech recognitions break down quickly within the sight of even moderate commotion, however can be high quality by including visual data from the speaker mouth region. Therefore, the new methodology taken… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…A couple of strategies have-been planned in the previous for the lip-understanding techniques. In [3] sis sorts of the visual-component were taken a gander at the discrete cosine transforms DCT has been investigated to the preeminent execution and it's connection with discrete wavelet transforms DWT with the strategy of the principal component analysis PCA as well as active appearance-model AAM moves close to the lip geometrical techniques. In [4] lip-perusing approaches such as motion history imaging MHI have been depicted in the progression improvements were gotten and total subtraction systems gave an amazing depiction of the visual talk remembers for a single grayscale picture and the artificial neural network ANN, were used to amass MHI pictures.…”
Section: Lip-segmentationmentioning
confidence: 99%
“…A couple of strategies have-been planned in the previous for the lip-understanding techniques. In [3] sis sorts of the visual-component were taken a gander at the discrete cosine transforms DCT has been investigated to the preeminent execution and it's connection with discrete wavelet transforms DWT with the strategy of the principal component analysis PCA as well as active appearance-model AAM moves close to the lip geometrical techniques. In [4] lip-perusing approaches such as motion history imaging MHI have been depicted in the progression improvements were gotten and total subtraction systems gave an amazing depiction of the visual talk remembers for a single grayscale picture and the artificial neural network ANN, were used to amass MHI pictures.…”
Section: Lip-segmentationmentioning
confidence: 99%
“…Features may contain unreliable data, which may lead the classification process to produce undesirable results; thus, a feature selection approach is considered a solution for this kind of problem [13]. Also, the selection of appropriate highlights assumes a fundamental job in the selection process [14]. Feature selection (FS) is an essential machine learning technique for classification applications to achieve an optimal subset of input features [15].…”
Section: Introductionmentioning
confidence: 99%