2018
DOI: 10.1609/aaai.v32i1.11937
|View full text |Cite
|
Sign up to set email alerts
|

Addressee and Response Selection in Multi-Party Conversations With Speaker Interaction RNNs

Abstract: In this paper, we study the problem of addressee and response selection in multi-party conversations. Understanding multi-party conversations is challenging because of complex speaker interactions: multiple speakers exchange messages with each other, playing different roles (sender, addressee, observer), and these roles vary across turns. To tackle this challenge, we propose the Speaker Interaction Recurrent Neural Network (SI-RNN). Whereas the previous state-of-the-art system updated speaker embeddings only f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(15 citation statements)
references
References 28 publications
0
15
0
Order By: Relevance
“…Existing methods on building MPC systems can be generally categorized into retrieval-based approaches (Ouchi and Tsuboi, 2016;Zhang et al, 2018;Wang et al, 2020;Gu et al, 2021Gu et al, , 2023 or generation-based (Hu et al, 2019;Gu et al, 2022;Li and Zhao, 2023). On the one hand, Ouchi and Tsuboi (2016) and Zhang et al (2018) proposed to update speaker embeddings with conversation streams dynamically and role-sensitively. Wang et al (2020) proposed to track the dynamic topic in a conversation.…”
Section: Multi-party Conversationsmentioning
confidence: 99%
“…Existing methods on building MPC systems can be generally categorized into retrieval-based approaches (Ouchi and Tsuboi, 2016;Zhang et al, 2018;Wang et al, 2020;Gu et al, 2021Gu et al, , 2023 or generation-based (Hu et al, 2019;Gu et al, 2022;Li and Zhao, 2023). On the one hand, Ouchi and Tsuboi (2016) and Zhang et al (2018) proposed to update speaker embeddings with conversation streams dynamically and role-sensitively. Wang et al (2020) proposed to track the dynamic topic in a conversation.…”
Section: Multi-party Conversationsmentioning
confidence: 99%
“…Thus, the ability to predict which agent in the conversation is the most likely to speak next, and conversely, when an agent must wait before interacting, is important for conducting engaging and social conversations (Pinhanez et al, 2018;de Bayser et al, 2019). Furthermore, detecting who is being addressed, i.e., who the current speaker is talking to, is also non-trivial in these conversation scenarios (Ouchi and Tsuboi, 2016;Zhang et al, 2018;Le et al, 2019;Gu et al, 2021Gu et al, , 2023. Last but not least, only after knowing a speaker and an addressee at the current dialogue state, can the system return an appropriate response following the conversation Wang et al, 2020;Gu et al, 2022a;Li and Zhao, 2023).…”
Section: Tutorial Outlinementioning
confidence: 99%
“…Explicit Addressee Recognition This task explicitly determines the intended addressee of an utterance. Previous studies mainly focus on predicting the addressee of only the last utterance of a conversation (Ouchi and Tsuboi, 2016;Zhang et al, 2018), while recent studies pay more attention to predicting the addressees of all utterances of a conversation (Le et al, 2019;Gu et al, 2021Gu et al, , 2023. Le et al (2019) propose a who-to-whom (W2W) model to recognize and complete the addressees of all utterances in a conversation to help understand the whole conversation, given an MPC where part of the addressees are unspecified.…”
Section: Address Whommentioning
confidence: 99%
See 2 more Smart Citations