Proceedings of the 24th International Conference on World Wide Web 2015
DOI: 10.1145/2736277.2741669
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Online Evaluation of Intelligent Assistants

Abstract: Voice-activated intelligent assistants, such as Siri, Google Now, and Cortana, are prevalent on mobile devices. However, it is challenging to evaluate them due to the varied and evolving number of tasks supported, e.g., voice command, web search, and chat. Since each task may have its own procedure and a unique form of correct answers, it is expensive to evaluate each task individually. This paper is the first attempt to solve this challenge. We develop consistent and automatic approaches that can evaluate dif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
91
0
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 124 publications
(95 citation statements)
references
References 31 publications
2
91
0
1
Order By: Relevance
“…We also look at the correlation between effort and completion. An obvious finding is that user satisfaction depends on ASR quality which is consistent with previous research [123]. Hence ASR quality is a key component of user satisfaction.…”
Section: Scenarios Of Usesupporting
confidence: 89%
See 4 more Smart Citations
“…We also look at the correlation between effort and completion. An obvious finding is that user satisfaction depends on ASR quality which is consistent with previous research [123]. Hence ASR quality is a key component of user satisfaction.…”
Section: Scenarios Of Usesupporting
confidence: 89%
“…We proposed three main types of scenarios of use: (1) device control; (2) web search; and (3) structured search dialogue. The scenarios were identified on the basis of three factors: their proportional existence in the logs of a commercial intelligent assistant; the way requests are handled at the intelligent assistant backend (e. g. user requests are redirected to the different services and they serve different interfaces); and the way scenarios were defined in previous works [123]. Next, we investigated: RQ 4.2: How can we measure different aspects of user satisfaction?…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations