Thus, although performance assessments are valued because of their perceived high validity, it may not be possible to collect enough information through performance assessments alone to accurately estimate each examinee’s proficiency level; multiple-choice items, which require less time to answer and
which may be scored by machine rather than by human raters, may be used to
increase the reliability of the large-scale test.