If the difference in the percentage choosing a score of 5 for a rater is
significantly higher than the mean of such differences across all raters, the rater is flagged as 5_H
(the rater awards score 5 more often than other raters). On the other hand, if the difference of percent
choosing score 5 is significantly lower than its mean, the rater is flagged as 5_L (the rater seldom uses
category 5). If a rater has both flags VR_L and 3_H (the rater uses category 3 more often than other
raters), then this rater tends to use the score categories in the middle of the scale. Individual raters
are also flagged if their ratings are different from the final scores by more than 1 point, or if their
exact agreement rate is significantly lower or higher than that of other raters. SA also summarizes
how many times a rater’s rating is discrepant from the final rating. For the item types with score
categories from 0 to 5, the total number of flags can be 11; for item types with score categories from
0 to 4, the total number of flags can be 10.