Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue 2018
DOI: 10.18653/v1/w18-5033
|View full text |Cite
|
Sign up to set email alerts
|

Conversational Image Editing: Incremental Intent Identification in a New Dialogue Task

Abstract: We present "conversational image editing", a novel real-world application domain combining dialogue, visual information, and the use of computer vision. We discuss the importance of dialogue incrementality in this task, and build various models for incremental intent identification based on deep learning and traditional classification algorithms. We show how our model based on convolutional neural networks outperforms models based on random forests, long short term memory networks, and conditional random field… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 20 publications
(11 citation statements)
references
References 35 publications
0
10
1
Order By: Relevance
“…The best accuracy reported on the aforementioned conversational image editing dataset was 74% on intent classification, ignoring actual attribute values (Manuvinakurike et al, 2018a). This result is not directly comparable to the best accuracy 61.3% on our dataset due to the difference in accuracy definition.…”
Section: Comparison To Performance On Image Editingcontrasting
confidence: 60%
“…The best accuracy reported on the aforementioned conversational image editing dataset was 74% on intent classification, ignoring actual attribute values (Manuvinakurike et al, 2018a). This result is not directly comparable to the best accuracy 61.3% on our dataset due to the difference in accuracy definition.…”
Section: Comparison To Performance On Image Editingcontrasting
confidence: 60%
“…In the literature, different types of models have been applied to the task of intent classifi-cation for incremental NLU (e.g. DeVault et al, 2009;Manuvinakurike et al, 2018;Constantin et al, 2019;Coman et al, 2019;Madureira and Schlangen, 2020, inter alia). A typical approach is to segment complete utterances into increasingly longer partial utterances.…”
Section: Previous Workmentioning
confidence: 99%
“…Vision and dialogue is an emerging topic of intersection field between computer vision and natural language processing. Conversational image editing system research [17], [18] attempts to understand the user utterance and identify the user's intention in an interactive image editing task using existing image editing software such as Adobe Photoshop and OpenCV. Our proposed method has the same motivation to identify the user's intention; however, our editing system is based on image generative models.…”
Section: Related Work a Vision And Dialoguementioning
confidence: 99%