The Evaluator Effect: A Chilling Fact About Usability Evaluation Methods

Hertzum, Morten; Jacobsen, N. Kingo

doi:10.1207/s15327590ijhc1304_05

Cited by 248 publications

(112 citation statements)

References 26 publications

Supporting

Mentioning

106

Contrasting

Unclassified

Order By: Relevance

“…While it is possible to perform a test to determine quantitative measures like efficiency, effectiveness, and satisfaction (ISO, 1998), another common goal is to identify specific parts of a system that cause users trouble (Hertzum and Jacobsen, 2001) in order to improve the system. This is often called formative testing (Barnum, 2002).…”

Section: Introductionmentioning

confidence: 99%

“…However, research has shown that the set of identified problems depends both on the users taking part in the user test (Nielsen, 1994), and the evaluators analysing the data (Jacobsen, 1999;Hertzum and Jacobsen, 2001). Usually, products are tested with users who use these products for the first time, but it could be expected that there are differences in users' behaviour when they have become more familiar with a product.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Identifying usability and fun problems in a computer game during first use and after some practice

Barendregt

Bekker

Bouwhuis

et al. 2006

International Journal of Human-Computer Studies

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Identifying usability and fun problems in a computer game during first use and after some practice

Barendregt

Bekker

Bouwhuis

et al. 2006

International Journal of Human-Computer Studies

View full text Add to dashboard Cite

“…This may be explained as an evaluator effect, since the Nielsen heuristics rating involves interpretation of the heuristics (Hertzum and Jacobsen, 2003). It is interesting to note that customer ratings, time on task measurements and mental workload measurements yield the same results, while the Nielsen heuristics rating does not.…”

Section: Layoutmentioning

confidence: 92%

“…It is interesting to note that customer ratings, time on task measurements and mental workload measurements yield the same results, while the Nielsen heuristics rating does not. The latter has also been questioned in terms of the type of problems that it detects (Wixon, 2003) and of evaluator effects (Hertzum and Jacobsen, 2003;Jeffries and Desurvire, 1992;Ling and Salvendy, 2009).…”

Section: Layoutmentioning

confidence: 99%

“…Owing to the functional revolution in modern cars, UEMs employed within the car domain now have similar requirements as those employed within the general HCI domain. In the HCI domain, much effort has been spent on securing the reliability and validity of UEMs (Gray and Salzman, 1998;Hartson et al, 2003;Hertzum and Jacobsen, 2003;Jeffries and Desurvire, 1992;Ling and Salvendy, 2009;Nielsen, 1994;Wixon, 2003). However, in the case of UEMs for IVIS, where the assessed systems are operated as secondary tasks and are thus central to safety, more effort has been put into identifying safety related problems in expert inspections (AAM, 2002;ESOP, 2006;Stevens et al, 1999) rather than verifying whether the problems encountered are found in user assessments.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation