2011
DOI: 10.1007/s10772-011-9119-z
|View full text |Cite
|
Sign up to set email alerts
|

Combining formant frequency based on variable order LPC coding with acoustic features for TIMIT phone recognition

Abstract: Combination of multiple acoustic features has great potential to improve Automatic Speech Recognition (ASR) accuracy. Our contribution in this research was to investigate one novel method when using voiced formants' features in combination with standard MFCC features in order to enhance TIMIT phone recognition. These voiced features provide accurate formants frequencies using a Variable Order LPC Coding (VO-LPC) algorithm that was combined with continuity constraints. The overall estimating formants were conca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 22 publications
0
10
0
Order By: Relevance
“…At present, the most popular and successful speech recognition systems use Hidden Markov Models (HMM) [23][24][25][26] in the acoustic modeling. HMM is used to train the acoustic models of sixty one phonemes along with a model of silence (sil).…”
Section: Experimental Setup and Discussionmentioning
confidence: 99%
“…At present, the most popular and successful speech recognition systems use Hidden Markov Models (HMM) [23][24][25][26] in the acoustic modeling. HMM is used to train the acoustic models of sixty one phonemes along with a model of silence (sil).…”
Section: Experimental Setup and Discussionmentioning
confidence: 99%
“…Next, we encode one of the most popular speech dataset TIMIT (Garofolo, 1993) into a spike-version, Spike-TIMIT. TIMIT dataset consists of richer acoustic-phonetic content than TIDIGITS (Messaoud and Hamida, 2011). It consists of continuous speech utterances, that are useful for the evaluation of speech coding schemes (Besacier et al, 2000), speech enhancement El-Solh et al (2007) or ASR systems (Mohamed et al, 2011;Graves et al, 2013).…”
Section: Spike-tidigits and Spike-timit Databasesmentioning
confidence: 99%
“…Furthermore, three noises such as car, jet and speech from the Noisex-92 database has been added to clean data at different signal-to-noise ratios (SNRs) (clean, 20, 15, 10, 5 and 0 dB). In this experiment, the hidden Markov model (HMM) [21][22][23] is used in the back end as phoneme recogniser.…”
Section: Experimental Frameworkmentioning
confidence: 99%