To make (almost) sure that the collection of frequent
sets in the sample includes all sets that really
are frequent in r, the frequency threshold is lowered
to, e.g., 1.5 %. Algorithm 1 now determines the collection
S = F(s, 1.5 %) from the sampled 20,000 rows.
Let the maximal sets of S be