2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7472860
|View full text |Cite
|
Sign up to set email alerts
|

Adversarial Bandit for online interactive active learning of zero-shot spoken language understanding

Abstract: Many state-of-the-art solutions for the understanding of speech data have in common to be probabilistic and to rely on machine learning algorithms to train their models from large amount of data. The difficulty remains in the cost of collecting and annotating such data. Another point is the time for updating an existing model to a new domain. Recent works showed that a zero-shot learning method allows to bootstrap a model with good initial performance. To do so, this method relies on exploiting both a small-si… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 11 publications
(17 citation statements)
references
References 18 publications
0
16
0
1
Order By: Relevance
“…An on-line adaptation strategy is adopted, as presented in [11] and briefly recalled here. In this approach, at each dialogue iteration, the system chooses an adaptation action i t ∈ I and uses the user feedback to update K.…”
Section: On-line Learning For Zero-shot Spmentioning
confidence: 99%
See 2 more Smart Citations
“…An on-line adaptation strategy is adopted, as presented in [11] and briefly recalled here. In this approach, at each dialogue iteration, the system chooses an adaptation action i t ∈ I and uses the user feedback to update K.…”
Section: On-line Learning For Zero-shot Spmentioning
confidence: 99%
“…In this system could have been implemented the most sample-efficient learning algorithms [8], from which on-line learning with direct interactions with the user could have been proposed [9]. More recently on-line learning has been generalized to the input/output modules, SP and NLG, with protocols to control the cost of such operations during the system development (as in [10,11,12]). In this work it is our first attempt to combine the on-line learning of SP and DM in a single phase of development.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In this respect, a reinforcement learning approach based on an adversarial bandit scheme is applied [14]. If this approach has previously been used in dialogue systems for language understanding [15,16], we propose a protocol to adapt the RNN-based model on new utterances that vary from the training dataset, taking into account the cost it implies for the user to give these examples.…”
Section: Related Workmentioning
confidence: 99%
“…Lately, stochastic approaches made a breakthrough in natural language processing and are now used by state-of-the-art systems for the SLU (e.g. [3,6]) and the dialog manager (e.g. [4,5,7]).…”
Section: Online Adaptation Of Spoken Dialog Systemmentioning
confidence: 99%