The experiment on ideal data shows the margin for improvement when filtering noise. As a final experiment, we measuredtheimpactofapplyingasimplefilteringheuristic:ignoring unique URLs with low frequencies. Frequencies within a set are only meaningful in relative terms, so our approach relies on the frequency distribution and filters out URLs with a frequency in the lower nth percentile.