It is interesting to see that for all three datasets, the best lift index is obtained when the
positive and negative examples are about equal in number (the difference with more negative examples is not statistically significant). If only 1% of training examples are positive (responders), we only need to use another 1% of the negative examples (non-responders) to form a new training set. This naturally reduces the size of the training set dramatically.