2020
DOI: 10.1007/978-3-030-63820-7_81
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning Based Personalized Neural Dialogue Generation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 5 publications
0
1
0
Order By: Relevance
“…To make the VA user-adaptive, the authors in (Saha et al, 2020c(Saha et al, ,d, 2018, proposed using a sentimentbased reward function for learning a dialogue policy in a task-oriented conversation. The authors of (Saha et al, 2020a) demonstrated how reinforcement learning may be used to generate meaningful responses while training generation frameworks. In (Saha et al, 2020b(Saha et al, , 2021a, the authors show how subtleties in human communication, such as Apart from these, several other work (Wei et al, 2019;Ide and Kawahara, 2021;Huo et al, 2020) that suggests using sentiment and/or emotion as an additional input in generation frameworks either during decoding or as reward to guide the models for generating responses aligned with the user's mood or feelings.…”
Section: Related Workmentioning
confidence: 99%
“…To make the VA user-adaptive, the authors in (Saha et al, 2020c(Saha et al, ,d, 2018, proposed using a sentimentbased reward function for learning a dialogue policy in a task-oriented conversation. The authors of (Saha et al, 2020a) demonstrated how reinforcement learning may be used to generate meaningful responses while training generation frameworks. In (Saha et al, 2020b(Saha et al, , 2021a, the authors show how subtleties in human communication, such as Apart from these, several other work (Wei et al, 2019;Ide and Kawahara, 2021;Huo et al, 2020) that suggests using sentiment and/or emotion as an additional input in generation frameworks either during decoding or as reward to guide the models for generating responses aligned with the user's mood or feelings.…”
Section: Related Workmentioning
confidence: 99%