Figure 3 shows the average number of features selected
on each dataset by the wrapper using naive Bayes
and by CFS. CFS generally selects a similar sized feature
set as the wrapper1. In many cases the number of
features is reduced by more than half by both methods.