It is assumed that these three IRT-based techniques would show substantial agreement in the detection of DIF among the same set of mathematics subtest items, but vary in the number of items flagged with DIF due to different assumptions and criteria used.