Proceedings of the 28th International Conference on Computational Linguistics: Industry Track 2020
DOI: 10.18653/v1/2020.coling-industry.3
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging User Paraphrasing Behavior In Dialog Systems To Automatically Collect Annotations For Long-Tail Utterances

Abstract: In large-scale commercial dialog systems, users express the same request in a wide variety of alternative ways with a long tail of less frequent alternatives. Handling the full range of this distribution is challenging, in particular when relying on manual annotations. However, the same users also provide useful implicit feedback as they often paraphrase an utterance if the dialog system failed to understand it. We propose MARUPA, a method to leverage this type of feedback by creating annotated training exampl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 21 publications
0
8
0
Order By: Relevance
“…MARUPA (Falke et al, 2020) (Mining Annotations from User Paraphrasing) is a tool to leverage realworld user implicit feedback to collect paraphrased utterances. Sometimes when a user is having a failed interaction with the system, the user will paraphrase the utterance and retry.…”
Section: Paraphrasing By User Feedbackmentioning
confidence: 99%
“…MARUPA (Falke et al, 2020) (Mining Annotations from User Paraphrasing) is a tool to leverage realworld user implicit feedback to collect paraphrased utterances. Sometimes when a user is having a failed interaction with the system, the user will paraphrase the utterance and retry.…”
Section: Paraphrasing By User Feedbackmentioning
confidence: 99%
“…Another challenge of building a skill recommender system is noisy and hard utterances that cannot be handled by NLU. Customers express their intents in many different ways with a long tail of rare utterances (Falke et al, 2020 ) which are hard for the voice assistant to interpret. Although these utterances are rare, in aggregate their volume is huge.…”
Section: Introductionmentioning
confidence: 99%
“…Dialogue comprehension (Sun et al, 2019;Cui et al, 2020) aims to capture diverse kinds of key information in utterances, which are either scattered around or implicitly implied in different turns of conversations. Therefore, it requires different capabilities such as paraphrasing (Falke et al, 2020), summarizing (Gliwa et al, 2019), and commonsense reasoning (Arabshahi et al, 2021). Recent advances in pre-trained language models (PLMs) (Devlin et al, 2019;Radford et al, 2019) have been applied to the problem (Jin et al, 2020;Liu et al, 2021).…”
Section: Introductionmentioning
confidence: 99%