Assumption: All terms which occur in more than 50% of the documents are not useful for any kind of classification. Thus, term-weights for these terms are set to value zero which implies that the scalar product of these terms in relation to other terms is also zero.