four reordering methods: (1) RAND, which randomly
shuffles the dataset; (2) CONF, which sorts the
records according to the values of the Conf attribute;
(3) SIGSORT, which uses the top 1000 most
frequently occurring words as signature words; and (4)
SIGSORTTSP, which uses 100 consequent records in
the sorted list obtained by SIGSORT as the candidate
set for each record (k = 100) and uses GNN heuristics
to solve the TSP. The original InvIndex is used as a
baseline.