Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue 2022
DOI: 10.18653/v1/2022.sigdial-1.28
|View full text |Cite
|
Sign up to set email alerts
|

GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers

Hsien-chin Lin,
Christian Geishauser,
Shutong Feng
et al.

Abstract: User simulators (USs) are commonly used to train task-oriented dialogue systems (DSs) via reinforcement learning. The interactions often take place on semantic level for efficiency, but there is still a gap from semantic actions to natural language, which causes a mismatch between training and deployment environment. Incorporating a natural language generation (NLG) module with USs during training can partly deal with this problem. However, since the policy and NLG of USs are optimised separately, these simula… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 22 publications
0
0
0
Order By: Relevance
“…These scripts include commonly used metrics such as: turn accuracy (ACC) and dialogue act F1 score for natural language understanding (NLU) (Zhu et al, 2020b), joint goal accuracy (JGA) and slot F1 score for dialogue state tracking (DST) , BLEU and slot error rate (SER) for natural language generation (NLG) (Wen et al, 2015), BLEU and Combined score (Comb.) for End2End dialogue modeeling (Mehri et al, 2019), turn accuracy, slot-value F1 score and SER for user simulators (Lin et al, 2021a(Lin et al, , 2022.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…These scripts include commonly used metrics such as: turn accuracy (ACC) and dialogue act F1 score for natural language understanding (NLU) (Zhu et al, 2020b), joint goal accuracy (JGA) and slot F1 score for dialogue state tracking (DST) , BLEU and slot error rate (SER) for natural language generation (NLG) (Wen et al, 2015), BLEU and Combined score (Comb.) for End2End dialogue modeeling (Mehri et al, 2019), turn accuracy, slot-value F1 score and SER for user simulators (Lin et al, 2021a(Lin et al, , 2022.…”
Section: Discussionmentioning
confidence: 99%
“…Besides existing models in ConvLab-2 (Zhu et al, 2020b), we integrate new transformer-based models supporting the unified data format, including SetSUMBT (van Niekerk et al, 2021) andTripPy (Heck et al, 2020) for dialogue state tracking (DST), DDPT (Geishauser et al, 2022) and LAVA (Lubis et al, 2020) for policy learning, SC-GPT for natural language generation (NLG), and SOLOIST with T5 as backbone model (Peng et al, 2022) for end-to-end modeling (End2End). We also integrate multiple powerful data-driven user simulators (US): TUS (Lin et al, 2021a) that outputs user dialogue acts, GenTUS (Lin et al, 2022) that outputs both user dialogue acts and response, and EmoUS (Lin et al, 2023) that additionally outputs emotions.…”
Section: Integrated Modelsmentioning
confidence: 99%
See 1 more Smart Citation