MCP: Self-supervised Pre-training for Personalized Chatbots with Multi-level Contrastive Sampling

Huang, Zhaoheng; Dou, Zhicheng; Zhu, Yuntian; Ma, Zheng-Yi

doi:10.48550/arxiv.2210.08753

Cited by 1 publication

(2 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As for embedding-based methods, traditional approaches (Li et al, 2016b;Al-Rfou et al, 2016) attempt to exploit user ID information, while DHAP (Ma et al, 2021) embed user dialogue history as implicit profiles. More recently, contrastive learning (Huang et al, 2022), refined retrieval (Zhong et al, 2022) and CVAE-based clustering (Tang et al, 2023) are explored to enhance the personalization performance. However, these approaches may still suffer from the personality scarcity of real-world posts without explicit modeling.…”

Section: Personalized Response Generationmentioning

confidence: 99%

“…For instance, while a statement like "I grew up in the deep south" conveys traits related to regional identity, it overlooks other personality dimensions such as language style, attitudes, and inner character nuances. Other methods for personalized dialogue generation often rely on user embeddings derived from social media platforms like Reddit (Qian et al, 2021;Ma et al, 2021;Huang et al, 2022;Zhong et al, 2022). However, these models encounter challenges due to the sparsity present in real-world posts, as they lack explicit persona modeling.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Miracle: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control

Lu,

Wei,

et al. 2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

Personalized dialogue systems aim to endow the chatbot agent with more anthropomorphic traits for human-like interactions. Previous approaches have explored explicitly user profile modeling using text descriptions, implicit derivation of user embeddings, or utilizing handicraft prompts for ChatGPT-like models. However, textual personas are limited in describing multi-faceted attributes (e.g., language style, inner character nuances), implicit embedding suffers from personality sparsity, and handicraft prompts lack fine-grained and stable controllability. Hence, these approaches may struggle with complex personalized dialogue generation tasks that require generating controllable responses with multiple personal attributes. To this end, we propose MIRACLE, a novel personalized dialogue generation method through MultIple PeRsonal Attributes Control within Latent-Space Energybased Models. Specifically, our approach first disentangles complex personality into multifaceted attributes. Subsequently, we employ a conditional variational auto-encoder to align with the dense personalized responses within a latent joint attribute space. We have also tailored a dedicated energy function and customized the ordinary differential equations sampling method to offer flexible attribute composition and precise attribute control. Extensive experiments demonstrate that MIRA-CLE outperforms state-of-the-art models regarding both personality controllability and response generation quality. Our dataset and code are available at https://github.com/ LZY-the-boys/MIRACLE

show abstract