Morten Hertzum & Niels Ebbe Jacobsen (2003)
International Journal of Human-Computer Interaction, 15(1), 183-204.
DOI: https://doi.org/10.1207/s15327590ijhc1501_14
Abstract. Reviews studies of heuristic evaluation, cognitive walkthrough, and thinking-aloud testing and finds that different evaluators using the same method on the same interface agree on only 5-65 percent of problems. The evaluator effect is large and underappreciated in usability practice.
Tags: evaluation reliability