Think Twice: A Human-like Two-stage Conversational Agent for Emotional Response Generation

Qian, Yushan; Wang, Bo; Ma, Shangzhao; Wu, Bin; Zhang, Shuo; Zhao, Dongming; Huang, Kun; Hou, Yuexian

doi:10.48550/arxiv.2301.04907

Cited by 2 publications

(2 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The availability of open-source datasets allowed the creation of many transformerbased empathetic response generation models (Li et al, 2020b;Majumder et al, 2020;Zheng et al, 2021;Sabour et al, 2022), all of which directly generate empathetic responses from the input prompt. Qian et al (2023) discovered that a two-stage system (response generation and style transfer) can yield better performance than one-stage models. However, a two-stage system requires two language models to be separately trained on different datasets.…”

Section: Related Workmentioning

confidence: 99%

Conditioning on Dialog Acts improves Empathy Style Transfer

Qu,

Ungar,

Sedoc

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

We explore the role of dialog acts in style transfer, specifically empathy style transfer -rewriting a sentence to make it more empathetic without changing its meaning. Specifically, we use two novel few-shot prompting strategies: target prompting, which only uses examples of the target style (unlike traditional prompting with source/target pairs); and dialog-actconditioned prompting, which first estimates the dialog act of the source sentence and then makes it more empathetic using few-shot examples of the same dialog act. Our study yields two key findings: (1) Target prompting typically improves empathy more effectively than pairwise prompting, while maintaining the same level of semantic similarity; (2) Dialog acts matter. Dialog-act-conditioned prompting enhances empathy while preserving both semantics and the dialog-act type. Different dialog acts benefit differently from different prompting methods, highlighting the need for further investigation of the role of dialog acts in style transfer.

show abstract

Section: Related Workmentioning

confidence: 99%

Conditioning on Dialog Acts improves Empathy Style Transfer

Qu,

Ungar,

Sedoc

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

show abstract

“…Human judges are commonly used when evaluating the degree of empathy exhibited in a dialogue response (Zhong et al, 2020;Sabour et al, 2022;Qian et al, 2023). There has also been some work on developing empathetic response and question taxonomies, although these are only applied in small-scale or synthetic settings (Welivita and Pu, 2020;Svikhnushina et al, 2022) (Zheng et al, 2021;Majumder et al, 2022) or for automatic evaluation (Kim et al, 2021;Lee et al, 2022).…”

Section: Empathymentioning

confidence: 99%

Leveraging Large Language Models for Automated Dialogue Analysis

Finch,

Paek,

Choi

2023

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue

View full text Add to dashboard Cite

Developing high-performing dialogue systems benefits from the automatic identification of undesirable behaviors in system responses. However, detecting such behaviors remains challenging, as it draws on a breadth of general knowledge and understanding of conversational practices. Although recent research has focused on building specialized classifiers for detecting specific dialogue behaviors, the behavior coverage is still incomplete and there is a lack of testing on real-world human-bot interactions. This paper investigates the ability of a state-of-the-art large language model (LLM), ChatGPT-3.5, to perform dialogue behavior detection for nine categories in real human-bot dialogues. We aim to assess whether ChatGPT can match specialized models and approximate human performance, thereby reducing the cost of behavior detection tasks. Our findings reveal that neither specialized models nor Chat-GPT have yet achieved satisfactory results for this task, falling short of human performance. Nevertheless, ChatGPT shows promising potential and often outperforms specialized detection models. We conclude with an in-depth examination of the prevalent shortcomings of ChatGPT, offering guidance for future research to enhance LLM capabilities.

show abstract

Think Twice: A Human-like Two-stage Conversational Agent for Emotional Response Generation

Cited by 2 publications

References 37 publications

Conditioning on Dialog Acts improves Empathy Style Transfer

Conditioning on Dialog Acts improves Empathy Style Transfer

Leveraging Large Language Models for Automated Dialogue Analysis

Contact Info

Product

Resources

About