2022
DOI: 10.48550/arxiv.2210.08753
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MCP: Self-supervised Pre-training for Personalized Chatbots with Multi-level Contrastive Sampling

Abstract: Personalized chatbots focus on endowing the chatbots with a consistent personality to behave like real users and further act as personal assistants. Previous studies have explored generating implicit user profiles from the user's dialogue history for building personalized chatbots. However, these studies only use the response generation loss to train the entire model, thus it is prone to suffer from the problem of data sparsity. Besides, they overemphasize the final generated response's quality while ignoring … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…As for embedding-based methods, traditional approaches (Li et al, 2016b;Al-Rfou et al, 2016) attempt to exploit user ID information, while DHAP (Ma et al, 2021) embed user dialogue history as implicit profiles. More recently, contrastive learning (Huang et al, 2022), refined retrieval (Zhong et al, 2022) and CVAE-based clustering (Tang et al, 2023) are explored to enhance the personalization performance. However, these approaches may still suffer from the personality scarcity of real-world posts without explicit modeling.…”
Section: Personalized Response Generationmentioning
confidence: 99%
See 1 more Smart Citation
“…As for embedding-based methods, traditional approaches (Li et al, 2016b;Al-Rfou et al, 2016) attempt to exploit user ID information, while DHAP (Ma et al, 2021) embed user dialogue history as implicit profiles. More recently, contrastive learning (Huang et al, 2022), refined retrieval (Zhong et al, 2022) and CVAE-based clustering (Tang et al, 2023) are explored to enhance the personalization performance. However, these approaches may still suffer from the personality scarcity of real-world posts without explicit modeling.…”
Section: Personalized Response Generationmentioning
confidence: 99%
“…For instance, while a statement like "I grew up in the deep south" conveys traits related to regional identity, it overlooks other personality dimensions such as language style, attitudes, and inner character nuances. Other methods for personalized dialogue generation often rely on user embeddings derived from social media platforms like Reddit (Qian et al, 2021;Ma et al, 2021;Huang et al, 2022;Zhong et al, 2022). However, these models encounter challenges due to the sparsity present in real-world posts, as they lack explicit persona modeling.…”
Section: Introductionmentioning
confidence: 99%