Interspeech 2009 2009
DOI: 10.21437/interspeech.2009-660
|View full text |Cite
|
Sign up to set email alerts
|

Hybridisation of expertise and reinforcement learning in dialogue systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 10 publications
0
3
0
Order By: Relevance
“…The first action consists of informing the user about the price of the regal resort and the second action consists of proposing another option, Hotel Globetrotter. Performing more than one action per turn is a challenge when using reinforcement learning (Fatemi et al, 2016;Gašić et al, 2012;Pietquin et al, 2011) and, to our knowledge, this has only been done in a simulated setting (Laroche et al, 2009).…”
Section: Dialogue Managementmentioning
confidence: 99%
“…The first action consists of informing the user about the price of the regal resort and the second action consists of proposing another option, Hotel Globetrotter. Performing more than one action per turn is a challenge when using reinforcement learning (Fatemi et al, 2016;Gašić et al, 2012;Pietquin et al, 2011) and, to our knowledge, this has only been done in a simulated setting (Laroche et al, 2009).…”
Section: Dialogue Managementmentioning
confidence: 99%
“…R : S → R, the immediate reward stochastic function, defines the goal(s) 1 . In some settings such as dialogue systems (Laroche et al 2009;Lemon and Pietquin 2012) or board games (Tesauro 1995;Silver et al 2016), R can be inferred directly from the state by the agent, and in some others such as in robotics and in Atari games (Mnih et al 2013;, R is generally unknown. Finally, γ ∈ [0, 1) the discount factor is a parameter given to the RL optimisation algorithm favouring short-term rewards.…”
Section: Introductionmentioning
confidence: 99%
“…Research on negotiation dialogue experiences a growth of interest. At first, Reinforcement Learning (Sutton and Barto, 1998), the most popular framework for dialogue management in dialogue systems (Levin and Pieraccini, 1997;Laroche et al, 2009;Lemon and Pietquin, 2012), was applied to negotiation with mitigated results (English and Heeman, 2005;Georgila and Traum, 2011;Lewis et al, 2017), because the non-stationary policy of the opposing player prevents those algorithms from converging consistently.…”
Section: Introductionmentioning
confidence: 99%