2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) 2015
DOI: 10.1109/asru.2015.7404778
|View full text |Cite
|
Sign up to set email alerts
|

Personalizing universal recurrent neural network language model with user characteristic features by social network crowdsourcing

Abstract: With the popularity of mobile devices, personalized speech recognizer becomes more realizable today and highly attractive. Each mobile device is primarily used by a single user, so it's possible to have a personalized recognizer well matching to the characteristics of individual user. Although acoustic model personalization has been investigated for decades, much less work have been reported on personalizing language model, probably because of the difficulties in collecting enough personalized corpora. Previou… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…It combines with convolutional neural network in deep learning to filter redundant photo information. In order to capture the characteristics of users, Tseng et al [51] proposed a speech recognition model named RNNLM, which uses RNN to capture the features of different users. Toman et al [52] used the basic deep learning model DNN for crowdsourcing voice data collection, through data selection and enhancement of a large number of voices.…”
Section: Mobile Crowdsourcing Techniques With Deep Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…It combines with convolutional neural network in deep learning to filter redundant photo information. In order to capture the characteristics of users, Tseng et al [51] proposed a speech recognition model named RNNLM, which uses RNN to capture the features of different users. Toman et al [52] used the basic deep learning model DNN for crowdsourcing voice data collection, through data selection and enhancement of a large number of voices.…”
Section: Mobile Crowdsourcing Techniques With Deep Learningmentioning
confidence: 99%
“…Hamrouni et al [50] Presenting photo-based MCS framework. Tseng et al [51] Presenting a speech recognition model RNNLM. Toman et al [52] Using the basic deep learning model DNN for crowdsourcing voice data collection.…”
Section: Referencesmentioning
confidence: 99%
“…Language modeling is a critical component for many NLIs, and personalization is a natural direction to improve these interfaces. Several published works have explored personalization of language models using historical search queries (Jaech and Ostendorf, 2018), features garnered from social graphs (Wen et al, 2012;Tseng et al, 2015;Lee et al, 2016), and transfer learning techniques (Yoon et al, 2017). Other work has explored using profile information (location, name, etc.)…”
Section: Related Workmentioning
confidence: 99%
“…The Smart Compose model is extended for personalization, as detailed in Section 5. As an important feature, personalization has been considered as a specific language model adaptation techniques in various applications, see for example [21,22,34,47,50]. In terms of the personalized model architecture, perhaps most close to our approach is Chen et al [11], which is also a linear interpolation between an n-gram and recurrent neural network language model.…”
Section: Related Workmentioning
confidence: 99%