2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854205
|View full text |Cite
|
Sign up to set email alerts
|

Accurate client-server based speech recognition keeping personal data on the client

Abstract: In this paper, a novel technique is proposed that recognizes speech on a server but all private knowledge is processed on the client. Private knowledge could be address book entries, calendar entries or medical patient data.The technique combines the advantage of a powerful server with almost unlimited memory and the advantage using locally available user dependent knowledge. A dynamic language model is used to recognize speech with the help of content dependent acoustic fillers on a server. The result is then… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…This hybrid ASR system computes its final recognition result using server-side ASR system combination, in which the recognition result from a highly personalized on-device ASR system is combined with the results from a more generic server based ASR system. Similar hybrid approaches to ASR personalization have been explored before [22,23,24], albeit not to our knowledge in the context of at scale FT of the personalized on-device ASR system.…”
Section: Asr Personalizationmentioning
confidence: 99%
“…This hybrid ASR system computes its final recognition result using server-side ASR system combination, in which the recognition result from a highly personalized on-device ASR system is combined with the results from a more generic server based ASR system. Similar hybrid approaches to ASR personalization have been explored before [22,23,24], albeit not to our knowledge in the context of at scale FT of the personalized on-device ASR system.…”
Section: Asr Personalizationmentioning
confidence: 99%
“…However, data privacy regulations as well as increasing demand for cloud offloading motivates execution on low power platforms without any need for cloud connection as proposed by, e.g., Georg Stemmer et al [31]. It is also in discussion to distribute the computation between embedded devices such as mobile phones and the cloud as proposed, e.g., by Munir Georges et al [32] or Alice Coucke et al [33]. Recent development goes in the direction of end-to-end and all neuronal network systems.…”
Section: Intent Classificationmentioning
confidence: 99%
“…Examples are: improved protection of user privacy, lower recognition latency, or the autonomy from communications channels. Local inference has been previously addressed for speech recognition [8] and spoken language understanding [9]. The main challenges in local inference is the limited compute and memory available on the device.…”
Section: Introductionmentioning
confidence: 99%