2019
DOI: 10.1007/978-981-13-9443-0_32
|View full text |Cite
|
Sign up to set email alerts
|

Subjective Annotation and Evaluation of Three Different Chatbots WOCHAT: Shared Task Report

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…Another important challenge is the development of automatic evaluation protocols and metrics that can be used to reduce the current approach of asking humans to provide their subjective perception and detect mistakes (Deriu et al 2019; Kong‐Vega et al 2019). The limitation of having objective metrics directly focused for dialogue systems slow down the use of current state‐of‐the‐art artificial intelligence algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…Another important challenge is the development of automatic evaluation protocols and metrics that can be used to reduce the current approach of asking humans to provide their subjective perception and detect mistakes (Deriu et al 2019; Kong‐Vega et al 2019). The limitation of having objective metrics directly focused for dialogue systems slow down the use of current state‐of‐the‐art artificial intelligence algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…Specifically, can we detect bots based on human response when data were collected under different settings and where the conversation topics differ? To address the out-of-domain research question, we use two additional datasets collected in the The Workshop on Chatbots and Conversational Agent Technologies (WOCHAT) (Kong-Vega et al, 2019). As part of a shared task, the workshop makes several bots available, has participants contribute new bots, and participants interact with the bots providing utterance-level feedback regarding their performance.…”
Section: Datasetsmentioning
confidence: 99%