Interspeech 2005 2005
DOI: 10.21437/interspeech.2005-116
|View full text |Cite
|
Sign up to set email alerts
|

Rapid porting of ASR-systems to mobile devices

Abstract: Portable devices for the consumer market are becoming available in large quantities. Because of their design and use, human speech often is the input modality of choice, for example for car navigation systems or portable speech-to-speech translation devices. In this paper we describe our work in porting our existing desktop PC based speech recognition system to an off-the-shelf PDA running WindowsCE3.0. We do this in a way that our already well performing language and acoustic models can be taken over without … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2006
2006
2012
2012

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…As the main input modality, we implemented speaker independent speech recognition using the integrated microphone of the mobile device. The ASR system uses the Janus Recognition Toolkit (JRTk) featuring the IBIS decoder [3] for the laptop version, and a PDA-optimized port [4] for the PDA version, that runs around 2-5x real-time on the 624MHz Intel XScale PXA270 processor. This is the standard processor found in most commercial PDAs.…”
Section: Automatic Speech Recognition (Asr)mentioning
confidence: 99%
See 1 more Smart Citation
“…As the main input modality, we implemented speaker independent speech recognition using the integrated microphone of the mobile device. The ASR system uses the Janus Recognition Toolkit (JRTk) featuring the IBIS decoder [3] for the laptop version, and a PDA-optimized port [4] for the PDA version, that runs around 2-5x real-time on the 624MHz Intel XScale PXA270 processor. This is the standard processor found in most commercial PDAs.…”
Section: Automatic Speech Recognition (Asr)mentioning
confidence: 99%
“…The Early Feature Vector Reduction [4] (EFVR) is used to remove redundant consecutive feature vectors, as found in silence and static vowels or noises, which results in a reduction of 25 to 50% of the feature vectors before they are fed into the decoder.…”
Section: Pda Specific Speedup Techniquesmentioning
confidence: 99%