Encyclopedia of Language &Amp; Linguistics 2006
DOI: 10.1016/b0-08-044854-2/00907-x
|View full text |Cite
|
Sign up to set email alerts
|

Speech Recognition: Statistical Methods

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(17 citation statements)
references
References 44 publications
0
17
0
Order By: Relevance
“…In this work, the phonetic transcription labels produced by the SRIs Decipher state-of-the-art ASR system [34] 36.1% respectively. While these results are equivalent to those obtained by other state-ofthe-art systems on similar databases [35], transcription errors will be non negligible and will produce that, in order to compute the i-vector for a particular linguistic unit, some frames belonging to a different one will be taken into account, degrading the performance of the system based on that unit. In this work, no exhaustive analysis has been done regarding whether the errors occurred are associated with particular units or speakers, as we have no transcriptions available for the datasets used.…”
Section: Region Conditioningmentioning
confidence: 54%
“…In this work, the phonetic transcription labels produced by the SRIs Decipher state-of-the-art ASR system [34] 36.1% respectively. While these results are equivalent to those obtained by other state-ofthe-art systems on similar databases [35], transcription errors will be non negligible and will produce that, in order to compute the i-vector for a particular linguistic unit, some frames belonging to a different one will be taken into account, degrading the performance of the system based on that unit. In this work, no exhaustive analysis has been done regarding whether the errors occurred are associated with particular units or speakers, as we have no transcriptions available for the datasets used.…”
Section: Region Conditioningmentioning
confidence: 54%
“…The ASR problem is formulated within the Bayesian framework. In this method, an utterance is represented by some sequence of acoustic feature vector X, derived from the underlying sequence of words W, and the recognition system needs to find the most likely word sequence as given below [37]…”
Section: Gmm/dnnmentioning
confidence: 99%
“…Actually, there are two possible types of speech feature modelling, deterministic and statistical models (Rabiner and Juang 2006). The deterministic models exploit the intrinsic properties of the speech signal, while the statistical models are concerned with statistical properties of the speech signal (Chien and Chueh 2009).…”
Section: Introductionmentioning
confidence: 99%