2014
DOI: 10.1007/s10772-014-9257-1
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of isolated words using Zernike and MFCC features for audio visual speech recognition

Abstract: Automatic Speech Recognition (ASR) by machine is an attractive research topic in signal processing domain and has attracted many researchers to contribute in this area. In recent year, there have been many advances in automatic speech reading system with the inclusion of audio and visual speech features to recognize words under noisy conditions. The objective of audio-visual speech recognition system is to improve recognition accuracy. In this paper we computed visual features using Zernike moments and audio f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 48 publications
(19 citation statements)
references
References 13 publications
0
19
0
Order By: Relevance
“…Given Vn,m(x,y), we can calculate the ZM Zn,m(I) of the n ‐th order and with the repetition number of m through Equation () (Borde et al, ) for an arbitrary image I ( x , y ) with the size of p × q pixels. In the calculation, I ( x , y ) can be alternatively viewed as a function defined over the integer space of x[0,p] and y[0,q].…”
Section: The New Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Given Vn,m(x,y), we can calculate the ZM Zn,m(I) of the n ‐th order and with the repetition number of m through Equation () (Borde et al, ) for an arbitrary image I ( x , y ) with the size of p × q pixels. In the calculation, I ( x , y ) can be alternatively viewed as a function defined over the integer space of x[0,p] and y[0,q].…”
Section: The New Methodsmentioning
confidence: 99%
“…These drawbacks can be resolved by region-based shape feature extraction methods, which represent an image as one or a collection of interior shape regions. Mainstream methods for deriving region-based shape descriptors include image moment-based descriptors (Borde, Varpe, Manza, & Yannawar, 2014;Chen & Sun, 2010;Papakostas, Boutalis, Karras, & Mertzios, 2007), gridbased descriptors (Lu & Sajjanhar, 1999), and distributionbased shape descriptors. Distribution-based shape descriptors are derived according to histograms of points or subregions of interest inside an image, for example, scaleinvariant feature descriptors (Lowe, 2004), oriented gradient-based descriptors (Dalal & Triggs, 2005;Mikolajczyk & Schmid, 2005), and sped-up robust features (Bay, Ess, Tuytelaars, & Van, 2008).…”
Section: Related Workmentioning
confidence: 99%
“…Some existingworks, for instant, Agus Harjokoexamined voices by using MFCC (Abriyono and Harjoko, 2012) to extract voice features. The same research have also been performed by Prasad (Borde, 2015) andChan (Chan et al, 2016) by using neural network.…”
Section: Introductionmentioning
confidence: 94%
“…iris and eye movement) [72]. [73], [74], [75], [76], [77], [78], [79], [80] Behavioral (Gait, Signature, Handwriting) Sparse reconstruction based metric learning, Gradient local binary patterns, longest run feature [81], [82] Bioacoustics Pitch, Zernike moment [83], [84] Bio-Signal (EEG, ERG, ECG, EMG, EOG, GSR, MEG, MCG, MMG)…”
Section: )mentioning
confidence: 99%