2008
DOI: 10.1109/icassp.2008.4518845
|View full text |Cite
|
Sign up to set email alerts
|

Live search for mobile:Web services by voice on the cellphone

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 55 publications
(29 citation statements)
references
References 5 publications
0
29
0
Order By: Relevance
“…CD-DNN-HMMs combine the representational power of deep neural networks and the sequential modeling ability of context-dependent hidden Markov models (HMMs). In this paper, we illustrate the key ingredients of the model, describe the procedure to learn the CD-DNN-HMMs' parameters, analyze how various important design choices affect the recognition performance, and demonstrate that CD-DNN-HMMs can significantly outperform strong discriminatively-trained context-dependent Gaussian mixture model hidden Markov model (CD-GMM-HMM) baselines on the challenging business search dataset of [36], collected under actual usage conditions. To our best knowledge, this is the first time DNN-HMMs, which are formerly only used for phone recognition, are successfully applied to large vocabulary speech recognition (LVSR) problems.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…CD-DNN-HMMs combine the representational power of deep neural networks and the sequential modeling ability of context-dependent hidden Markov models (HMMs). In this paper, we illustrate the key ingredients of the model, describe the procedure to learn the CD-DNN-HMMs' parameters, analyze how various important design choices affect the recognition performance, and demonstrate that CD-DNN-HMMs can significantly outperform strong discriminatively-trained context-dependent Gaussian mixture model hidden Markov model (CD-GMM-HMM) baselines on the challenging business search dataset of [36], collected under actual usage conditions. To our best knowledge, this is the first time DNN-HMMs, which are formerly only used for phone recognition, are successfully applied to large vocabulary speech recognition (LVSR) problems.…”
Section: Introductionmentioning
confidence: 99%
“…In section III, we describe the basic ideas, the key properties, and the training and decoding strategies of our CD-DNN-HMMs. In section IV we analyze experimental results on a 65K+ vocabulary business search dataset collected from the Bing mobile voice search application (formerly known as Live Search for mobile [36], [60]) under real usage scenarios. Section V offers conclusions and directions for future work.…”
Section: B Introduction To the Dnn-hmm Approachmentioning
confidence: 99%
“…We evaluated our CD-DBN-HMM system by conducting a series of experiments on the data collected from the Bing mobile voice search (BMVS) application (formerly known as Live Search for mobile [14]) -a real-world, large-vocabulary, spontaneous, continuous speech recognition task.…”
Section: Methodsmentioning
confidence: 99%
“…Our data come from the Windows Live Search for Mobile voice search task [19]. We divide the data into a 2.9M utterance (6.3M word) training set, 8.7K utterance (18.9K word) development set, and 12.7K utterance (27.2K word) test set.…”
Section: Datamentioning
confidence: 99%