Based on pioneering work carried out by Cyril Cleverdon and colleagues at Cranfield University in the 1960s (Cleverdon 1997), the popularity of test collections in IR evaluation has flourished in large part thanks to campaigns such as the Text Retrieval Conference (TREC), the CrossLanguage Evaluation Forum (CLEF), the NII Testbeds and Community for Information Access Research project (NTCIR), the Initiative for the Evaluation of XML Retrieval