2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
DOI: 10.1109/icassp.2001.940753
|View full text |Cite
|
Sign up to set email alerts
|

Speaker- and language-independent speech recognition in mobile communication systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(3 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…However, it has been proven that taking the maximum of all of the mixtures, instead of the sum, is a very good approximation of the result [17]. Hence, by taking the negative of the logarithm of (1) in order to convert probabilities into costs and applying the previous approximation, the acoustic cost can be evaluated by where α is a coefficient per mixture that encompasses all constants and parameters outside of the exponential function in (1). In order to further simplify the computation of the Gaussian block, the variance σ 2 is replaced by a new variable v as described in (3).…”
Section: Gaussian Calculationmentioning
confidence: 99%
See 1 more Smart Citation
“…However, it has been proven that taking the maximum of all of the mixtures, instead of the sum, is a very good approximation of the result [17]. Hence, by taking the negative of the logarithm of (1) in order to convert probabilities into costs and applying the previous approximation, the acoustic cost can be evaluated by where α is a coefficient per mixture that encompasses all constants and parameters outside of the exponential function in (1). In order to further simplify the computation of the Gaussian block, the variance σ 2 is replaced by a new variable v as described in (3).…”
Section: Gaussian Calculationmentioning
confidence: 99%
“…Large vocabulary speaker independent systems have potential in all forms of computing, from hand held mobile devices to personal computing and even large scale data centres. A low power, real-time embedded system could dramatically impact our daily interactions with digital mobile technology [1] while a faster than real-time multi-stream batch decoder could be used in server applications for distributed systems [2] or data-mining [3,4].…”
Section: Introductionmentioning
confidence: 99%
“…Due to globalization as well as the international nature of the markets and the future applications, speaker independence implies the development and use of language independent automatic speaker recognition to avoid logistic difficulties. Hence, they proposed architecture for embedded multilingual speech recognition systems [17]. Rama Murty and Yegnanarayana [8] combined the evidences from the residual phase and MFCC methods used for speaker recognition and obtained very good results.…”
Section: Introductionmentioning
confidence: 99%