The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions

Mohmand, Muhammad Ismail; Perbandaran, off Jalan

doi:10.30534/ijatcse/2019/31852019

Cited by 2 publications

(2 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A couple of strategies have-been planned in the previous for the lip-understanding techniques. In [3] sis sorts of the visual-component were taken a gander at the discrete cosine transforms DCT has been investigated to the preeminent execution and it's connection with discrete wavelet transforms DWT with the strategy of the principal component analysis PCA as well as active appearance-model AAM moves close to the lip geometrical techniques. In [4] lip-perusing approaches such as motion history imaging MHI have been depicted in the progression improvements were gotten and total subtraction systems gave an amazing depiction of the visual talk remembers for a single grayscale picture and the artificial neural network ANN, were used to amass MHI pictures.…”

Section: Lip-segmentationmentioning

confidence: 99%

The Geometrical Based Lip-Reading Techniques of Multi-Dimensional Dynamic Time Warping MDTW and Hidden Markov Models HMMs in the Audio Visual Speech Recognition

Mohmand¹

2020

IJATCSE

Self Cite

View full text Add to dashboard Cite

This paper portrays a programmed lip perusing framework comprising of the two primary modules such as, a pre-preparing module ready to separate lipgeometrical data from video grouping and a characterization module to recognize visual discourse dependent on unique lip-developments. The recognitions execution of the planned framework has been evaluated in the acknowledgment of English digits staring from 0 to 9 spoken by the speaker's in the video groupings accessible in Clemson University Audio Visual Experiments (CUAVE) database techniques. The extractions of lip-geometrical features was completed utilizing a blend of skinshading channel, an outskirt following calculation and an arched Hull-approach as well as proposed strategy was contrasted and the well-known 'snake' procedure and was found to expand the lip-shape extractions execution for database are considered. The lip-geometrical features including stature, width, proportion, territory, edge as well as different mixes of features were assessed to the figure out which plays by the speaking to discourse in the visual area in the use of three discrete arrangement techniques, in particular optical stream, Dynamic Time Warping (DTW) and another methodology named Multidimensional DTW and Hidden Markov Model (HMM). The experiments shows that the proposed framework is fit for an acknowledgment execution of 74% simply utilizing lip stature, conventional appearance-based Discrete Cosine Transform DCT techniques of the lip-width and proportion of these features exhibiting that framework can possibly be fused in a multimodal discourse recognitions framework for the use of energetic environments.

show abstract

Section: Lip-segmentationmentioning

confidence: 99%

The Geometrical Based Lip-Reading Techniques of Multi-Dimensional Dynamic Time Warping MDTW and Hidden Markov Models HMMs in the Audio Visual Speech Recognition

Mohmand¹

2020

IJATCSE

Self Cite

View full text Add to dashboard Cite

show abstract

“…Features may contain unreliable data, which may lead the classification process to produce undesirable results; thus, a feature selection approach is considered a solution for this kind of problem [13]. Also, the selection of appropriate highlights assumes a fundamental job in the selection process [14]. Feature selection (FS) is an essential machine learning technique for classification applications to achieve an optimal subset of input features [15].…”

Section: Introductionmentioning

confidence: 99%

Prediction-Based Model for Student Dropouts using Modified Mutated Firefly Algorithm

Gamao¹

2019

IJATCSE

View full text Add to dashboard Cite

Academic database is considered as the heart and soul of every higher education institutions. This database contains a vast amount of useful information that is useful for analysis. Algorithms for machine learning play a significant role in mining academic databases and have been proven to be effective when applied in the academic field. Prediction models are made using relevant classification algorithms for dropout analysis. The success of the prediction model depends on the performance of the feature selection algorithm used for dimensionality reduction. The study utilized the Modified Mutated Firefly Algorithm (MMFA) as a dimensionality reduction strategy to enhance the accuracy of the prediction model for dropout analysis. The results of the simulation revealed that the Decision Tree (DT) classifier outperformed the Naïve Bayesian using the three UCI datasets. After the test of benchmark datasets, a students' cumulative dataset was used to come up with a predictive model for dropout analysis of Davao del Norte State College, Davao del Norte, Philippines. The results of the experiment confirmed that the MMFA+DT obtained an accuracy rate of 95.82%, while MMFA+NB only has 92.85% using 10-fold cross-validation.

show abstract

The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions

Cited by 2 publications

References 17 publications

The Geometrical Based Lip-Reading Techniques of Multi-Dimensional Dynamic Time Warping MDTW and Hidden Markov Models HMMs in the Audio Visual Speech Recognition

The Geometrical Based Lip-Reading Techniques of Multi-Dimensional Dynamic Time Warping MDTW and Hidden Markov Models HMMs in the Audio Visual Speech Recognition

Prediction-Based Model for Student Dropouts using Modified Mutated Firefly Algorithm

Contact Info

Product

Resources

About