Modeling Ambiguity, Subjectivity, and Diverging Viewpoints in Opinion Question Answering Systems

Wan, Mengting; McAuley, Julian

doi:10.1109/icdm.2016.0060

Cited by 75 publications

(62 citation statements)

References 16 publications

(39 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The closely-following second and third most common reasons are answer granularity (GRN) and synonyms (SYN) which account for 72.9% and 68.3% of VQAs across both datasets ( Figure 3c; 2 person threshold). These findings highlight that most answer differences can be resolved by disambiguating visual questions or resolving synonyms and differing granularity [16,27,43].…”

Section: (Un)common Reasons For Answer Differencesmentioning

confidence: 99%

“…We offer our work as a valuable foundation for improving VQA services, by empowering system designers and users to know how to prevent, interpret, or resolve answer differences. Specifically, a solution that anticipates why a visual question will lead to different answers (summarized in Figure 1) could (1) help users identify how to modify their visual question in order to arrive at a single, unambiguous answer; e.g., retake an image when it is low quality or does not show the answer versus modify the question when it is ambiguous or invalid; (2) increase users' awareness for what reasons, if any, trigger answer differences when they are given a single answer; or (3) reveal how to automatically aggregate different answers [2,19,24,26,43] when multiple answers are collected.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Why Does a Visual Question Have Different Answers?

Bhattacharya

Gurari

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

Visual question answering is the task of returning the answer to a question about an image. A challenge is that different people often provide different answers to the same visual question. To our knowledge, this is the first work that aims to understand why. We propose a taxonomy of nine plausible reasons, and create two labelled datasets consisting of ∼45,000 visual questions indicating which reasons led to answer differences. We then propose a novel problem of predicting directly from a visual question which reasons will cause answer differences as well as a novel algorithm for this purpose. Experiments demonstrate the advantage of our approach over several related baselines on two diverse datasets. We publicly share the datasets and code at https://vizwiz.org.

show abstract

Section: (Un)common Reasons For Answer Differencesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Why Does a Visual Question Have Different Answers?

Bhattacharya

Gurari

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

show abstract

“…All datasets are in the form of (input, response) pairs. For UBUNTU 8 , SEMEVAL15 9 , and AMAZONQA 10 we use standard data splits into training, dev, and test portions following the original work (Lowe et al, 2017;Nakov et al, 2015;Wan and McAuley, 2016). For the OpenSubtitles dataset (OPENSUB) (Lison and Tiedemann, 2016), we rely on the data splits introduced by Henderson et al (2019).…”

Section: Methodsmentioning

confidence: 99%

Training Neural Response Selection for Task-Oriented Dialogue Systems

Henderson¹,

Vulić²,

Gerz³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

106

View full text Add to dashboard Cite

Despite their popularity in the chatbot literature, retrieval-based models have had modest impact on task-oriented dialogue systems, with the main obstacle to their application being the low-data regime of most task-oriented dialogue tasks. Inspired by the recent success of pretraining in language modelling, we propose an effective method for deploying response selection in task-oriented dialogue. To train response selection models for taskoriented dialogue tasks, we propose a novel method which: 1) pretrains the response selection model on large general-domain conversational corpora; and then 2) fine-tunes the pretrained model for the target dialogue domain, relying only on the small in-domain dataset to capture the nuances of the given dialogue domain. Our evaluation on six diverse application domains, ranging from e-commerce to banking, demonstrates the effectiveness of the proposed training method.

show abstract

“…The statistics are shown in Table 1. The original question-answer pairs are from a public data collection crawled by Wan and McAuley [34]. We also utilize the product ID in the QA dataset to align with the reviews in Amazon review dataset [14] so that the corresponding reviews of each product can be obtained.…”

Section: Datasets and Evaluation Metricsmentioning

confidence: 99%

Review-guided Helpful Answer Identification in E-commerce

Zhang

Lam

Deng

et al. 2020

Proceedings of the Web Conference 2020

View full text Add to dashboard Cite

Product-specific community question answering platforms can greatly help address the concerns of potential customers. However, the user-provided answers on such platforms often vary a lot in their qualities. Helpfulness votes from the community can indicate the overall quality of the answer, but they are often missing. Accurately predicting the helpfulness of an answer to a given question and thus identifying helpful answers is becoming a demanding need. Since the helpfulness of an answer depends on multiple perspectives instead of only topical relevance investigated in typical QA tasks, common answer selection algorithms are insufficient for tackling this task. In this paper, we propose the Review-guided Answer Helpfulness Prediction (RAHP) model that not only considers the interactions between QA pairs but also investigates the opinion coherence between the answer and crowds' opinions reflected in the reviews, which is another important factor to identify helpful answers. Moreover, we tackle the task of determining opinion coherence as a language inference problem and explore the utilization of pre-training strategy to transfer the textual inference knowledge obtained from a specifically designed trained network. Extensive experiments conducted on real-world data across seven product categories show that our proposed model achieves superior performance on the prediction task.

show abstract

Modeling Ambiguity, Subjectivity, and Diverging Viewpoints in Opinion Question Answering Systems

Cited by 75 publications

References 16 publications

Why Does a Visual Question Have Different Answers?

Why Does a Visual Question Have Different Answers?

Training Neural Response Selection for Task-Oriented Dialogue Systems

Review-guided Helpful Answer Identification in E-commerce

Contact Info

Product

Resources

About