The Selection task is a lightweight filter to find the pageURLs
in the Rankings table (1GB/node) with a pageRank above a userdefined threshold. For our experiments, we set this threshold parameter to 10, which yields approximately 36,000 records per data
file on each node.