2015
DOI: 10.1016/j.promfg.2015.07.434
|View full text |Cite
|
Sign up to set email alerts
|

Impact of Accuracy and Latency on Mean Opinion Scores for Speech Recognition Solutions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 3 publications
0
3
0
Order By: Relevance
“…T bat indicates the length of the audio samples to be decoded at a time, and is defined during the initialization stage of the ASR server. In this paper, we set T bat as 200 frames, each of which is 2-s long [14,26,36].…”
Section: Decoder Thread Of the Online Asr Servermentioning
confidence: 99%
See 1 more Smart Citation
“…T bat indicates the length of the audio samples to be decoded at a time, and is defined during the initialization stage of the ASR server. In this paper, we set T bat as 200 frames, each of which is 2-s long [14,26,36].…”
Section: Decoder Thread Of the Online Asr Servermentioning
confidence: 99%
“…During decoding, the minibatch size is set to 2 s. Although a larger minibatch size increases the decoding speed owing to the bulk computation of GPU, the latency also increases. We settle into 2 s of minibatch size as a compromise between decoding speed and latency [14,26,36].…”
Section: Corpus and Baseline Korean Asrmentioning
confidence: 99%
“…5) and they cause little extra overhead. According to a study conducted on 47 participants [43], the acceptable latency is 4 s and the acceptable accuracy is 0.70.…”
Section: Implementation and Overheadmentioning
confidence: 99%