Proceedings of the 8th International Conference on Frontiers of Information Technology 2010
DOI: 10.1145/1943628.1943629
|View full text |Cite
|
Sign up to set email alerts
|

Large vocabulary continuous speech recognition for Urdu

Abstract: This paper presents the development of acoustic and language models for robust Urdu speech recognition using the CMU Sphinx Open Source Toolkit for speech recognition. Three models have been developed incrementally, with the addition of speech data of up to two speakers per pass; one model using data from 40 female speakers only, one from 41 male speakers only, and one with both male and female speakers (81 speakers). This paper presents the current recognition results, and discusses approaches for improving t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(5 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…They represent each word as a single phoneme which means that the system performance may suffer degradation for longer words. More recent work for continuous speech Urdu ASR is by Huda et al [12] using Sphinx open source toolkit. We are not making direct comparisons to Huda's work [12] as the data scale as well as application scale differ than our work.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…They represent each word as a single phoneme which means that the system performance may suffer degradation for longer words. More recent work for continuous speech Urdu ASR is by Huda et al [12] using Sphinx open source toolkit. We are not making direct comparisons to Huda's work [12] as the data scale as well as application scale differ than our work.…”
Section: Related Workmentioning
confidence: 99%
“…More recent work for continuous speech Urdu ASR is by Huda et al [12] using Sphinx open source toolkit. We are not making direct comparisons to Huda's work [12] as the data scale as well as application scale differ than our work. Besides, the speech data used in their experimentation is limited to one particular accent only.…”
Section: Related Workmentioning
confidence: 99%
“…However, the results are based on single speaker speech recognition and extensive enhancements are required to transform the system into a multi-speaker system. (Sarfraz et al 2010a ; 2010b ) has also used CMU Sphinx Toolkit towards Large Vocabulary speech recognition of Urdu. The goal was to cover the everyday speech; however, the variety in Urdu accents has not been covered as the target speech is mostly limited to suburban accent spoken in offices and homes.…”
Section: Related Workmentioning
confidence: 99%
“…Urdu continuous speech recognition system [32] was developed using spontaneous speech corpus collected from 82 speakers [25]. Speech corpus was recorded over telephone and microphone channels.…”
Section: Introductionmentioning
confidence: 99%