1. Internal Consistency
Ratings at Category level should agree with Element level. A high level of consistency was
found.
2. Accuracy
Participants ratings should match the ‘correct answers’ formulated by the scenario
designers and expert raters. A high level of consistency was found, at the category level.
In the more ambiguous scenarios, the level of consistency was reduced. The largest
divergence from the expert results was that the experiment participants tended not to
use the rating not observed, even when the expert rating was not observed.
3. Inter-Rater agreement
The extent to which the different participants ratings agree. A high level of consistency
was found, at the category level.